Method for storing and retrieving data objects

ABSTRACT

The invention applies the economies of scale to data processing and provides automated methods of collecting, categorizing, indexing, storing and retrieving large amounts of data from a wide variety of sources. The invention discloses the use of location and industry templates to categorize documents and other data objects. Location and industry data for each document is translated into a set of assignment numerals that are used to create a header number. Each document or data object is indexed and stored in a data warehouse in accordance with the header number. Subsequently, documents and data objects are retrieved from the data warehouse using the index of header numbers and the location and industry templates.

RELATED APPLICATIONS

[0001] This is a divisional of application Ser. No. 09/973,659, filedOct. 9, 2001.

BACKGROUND OF THE INVENTION

[0002] 1. Field of the Invention

[0003] The invention relates to data processing including automatedmethods of collecting, categorizing, indexing, storing and retrievinglarge amounts of data from widely diverse sources. It includes advancesin the art of screening, refining and formatting data and an automatedmethod of storing and retrieving the refined data by location, industry,document number, etc. and getting the retrieved documents to therequesting party.

[0004] 2. Description of the Related Art

[0005] The explosion of knowledge the past twenty years has resulted inthousands of storage devices in diverse locations throughout the worldthat contain millions of documents in virtually all fields of endeavor.In addition, there are thousands of locations connected to the internetthat make available other millions of documents relating to information,intellectual properties, educated opinions, educational material, andsimilar assets, hereinafter broadly referred to as “Data”. Thedevelopment of economical high-speed computers with high volume storagecapacity led to an explosion of the information technology industry. Anatural outgrowth of this development was the Internet and Web Sitesthat linked thousands of databases and made millions of documentsavailable to the Internet and other public data storage and transportsystems. Techniques for polling and retrieving data contained in thesedatabases are well known in the industry. In addition, the capability ofusing Key Words to search through the Internet and other data transportmedia through various types of search engines such as Web Browser,Yahoo, Google, GoTo, Inktome, Alta-Vista and others is also well known.

[0006] Virtually every library, college, research organization,hospital, political entity, weather bureau, large business etc. havelocations on the Internet. Most have made large amounts of dataavailable to the public. The net effect is a huge amount of dataavailable in the public domain that covers virtually every phase ofhuman endeavor. Another huge reservoir of data is available fromthousands of sources for a fee for access to the database. A third, andlargely untapped source is printed matter, professional publications,magazines, libraries etc. The business world has long recognized thatinformation and knowledge is power and has tangible value. We use thatpower to gain a competitive edge and/or reduce operating costs. The costof searching, downloading, screening, storing, formatting and making theintelligence available to humans has become very expensive. In addition,the sheer mass of data dictates long hours of downloading, reading andrewriting to sort out what is needed from the extraneous data. For anindividual or business trying to gather, store and use information, themass of data has become overwhelming.

[0007] Because of the high cost of collecting, storing and retrievingdata, it has a negative value until it is available to humans forintuitive analysis. The following analogy will make this fact clear.Consider a library book, where a human, highly competent in somespecific area of knowledge, reduces his knowledge to writing andpublishes a book. If no other human knows where the book is, there is notransfer of information and the book is of little value. Eventually itfinds its way into a library where it is categorized, indexed by title,subject matter, and author and assigned to a specific aisle, shelflocation and shelf position. A person searching for this information mayhave had to search through many libraries, bookstores, publishinghouses, industrial publications and educational institutions beforefinding what he/she needed. This search grows more expensive and timeconsuming as the number of databases and volume of information growsexponentially. When the book is located, either electronically orverbally, the library has an organized method of locating theinformation requested. The applicant is told that the information isstored in aisle xx, cabinet xxx, shelf x, and is given the title name.These activities are expensive and time consuming and therefore have anegative economic value until the applicant is able to use theinformation contained in the book. The Supermine™ system makes thecollection, storage and distribution of data economical and readilyavailable.

[0008] In addition to knowledge being power, another axiom of thebusiness world is that B time is money. Where there was once a scarcityof information and data there is now a huge glut of material onvirtually every subject one can name. When the need arises for abusiness to obtain facts relating to a specific problem or objective,employees are required to search hundreds of sources and spend hoursscreening, reading and collating to find the information needed. Morehours are needed to assemble the data into usable form. Several daysspent in this process may mean the difference between success andfailure.

[0009] A need has thus developed for a system of economically utilizingthe vast amount of intelligence and knowledge that exists worldwide.This need entails a means of gathering the data in mass amounts,screening, filtering, categorizing, indexing and storing it with minimalhuman intervention. In addition, a means of making the data available bylocation, type information, industry application, and by the level ofdetail requested by humans is needed. Further, the data must be madereadily available to people from widely diverse locations usingdifferent languages, and it must be done without time consuming clericaloperations. That capability does not exist in industry today.

[0010] The agriculture industry is used in this disclosure to provide asimplistic explanation of the design and flexibility of the Supermine™system. However, a comprehensive world-wide system could be built bycoordinating the Supermine™ system as described in this disclosure withthe developing Satellite Global Positioning System. By using theconcepts taught in this disclosure, the Mercator Projection Charts couldbe used as the basis for constructing Supermine™ system LocationTemplates using Latitude and Longitude data from the Global PositioningSystem to describe precise locations instead of the names of Nations,States and Cities. This system provides a virtually unlimited potentialfor collecting, classifying, indexing, storing and retrieving data thatcan be efficiently associated with precise locations. This disclosureteaches methods of associating any industrial data with any of theseprecise locations. The methods of accomplishing this and the uniqueflexibility of the Supermine™ System is described and demonstrated inthe disclosure. The present invention discloses a system that solvesmany of the problems inherent in the present state of the art.

SUMMARY OF THE INVENTION

[0011] In one embodiment, the present invention provides a method ofstoring data objects in a data warehouse including receiving a dataobject, identifying a geographic location to which the data object isrelated, associating a numeric representation with the geographiclocation, identifying an industry to which the data object is related,associating a numeric representation with the industry and indexing thedata object in the data warehouse based on a header number, wherein theheader number includes the numeric representations of the geographiclocation and industry. One feature of this embodiment may be providing alocation template having a plurality of columns, the columnscorresponding to nations, states and cities. Further, the embodiment mayinclude searching the data object for a term stored in a column of thelocation template, the term representing a specific nation, state, orcity. Another feature of this embodiment may be providing an industrytemplate having a plurality of columns, the columns corresponding toindustry names and industry functions. Yet another feature may besearching the data object for a term stored in a column of the industrytemplate, the term representing a specific industry name or industryfunction. Still another feature may be storing the numericrepresentations of the geographic location and the industry in specificslots in a register designated for the header number. Other features ofthis embodiment include providing location templates associated withdifferent languages and providing industry templates associated withdifferent languages. This embodiment has other features, such as theheader number including a unique document number or a numericrepresentation of a date on which the data object was received. The nextfeature may be storing the data object in a location of the datawarehouse that is associated with the header number.

[0012] In a second embodiment, the invention discloses a method ofretrieving a data object stored in a data warehouse including receivinga request for the data object that is stored in the data warehouse,parsing the request to identify a geographic location to which the dataobject is related, associating a numeric representation with thegeographic location, parsing the request to identify an industry towhich the data object is related, associating a numeric representationwith the industry, generating a header number that includes the numericrepresentations of the geographic location and the industry, searching aheader number index of the data warehouse for the header number,identifying the data object based on the header number, and retrievingthe data object from the data warehouse. One feature of this embodimentmay be providing a location template having a plurality of columns, thecolumns corresponding to nations, states and cities and searching therequest for a term stored in a column of the location template, the termrepresenting a specific nation, state or city. A second feature may beproviding location templates associated with different languages andsearching the request by utilizing a location template associated with aspecific language identified in the request. Another feature may beproviding an industry template having a plurality of columns, thecolumns corresponding to industry names and industry functions andsearching the request for a term stored in a column of the industrytemplate, the term representing a specific industry name or industryfunction. Yet another feature of this embodiment may be providingindustry templates associated with different languages and searching therequest by utilizing an industry template associated with a specificlanguage identified in the request. This embodiment may also include thefeature of searching the request for a first matching term in a firstlist of terms, associating a first numeric representation with the firstmatching term, and searching the request for a second matching term in asecond list of terms, wherein the searching for the second matching termis limited to a subset of terms, the subset of the second list of termsbeing associated with the first matching term. The next feature mayinclude the first list of terms comprising a column corresponding tonations in a location template, and the second list of terms comprisinga column corresponding to cities in a location template; or the firstlist of terms comprises a column corresponding to industry names in anindustry template, and the second list of terms comprises a columncorresponding to industry functions in an industry template. Thisembodiment may also include a feature wherein the header number furtherincludes a unique document number or a numeric representation of a dateon which the data object was received. The next feature of thisembodiment may be retrieving the data object from a location of the datawarehouse that is associated with the header number.

[0013] In a third embodiment, the invention discloses a computer systemcomprising a microprocessor, a storage device coupled to themicroprocessor, the storage device adapted to store software routines,and a software routine stored on the storage device to be executed bythe microprocessor, wherein the software routine comprises instructionsto perform a method of storing data objects in a data warehouse. Saidmethod comprises, receiving a data object, identifying a geographiclocation to which the data object is related, associating a numericrepresentation with the geographic location, identifying an industry towhich the data object is related, associating a numeric representationwith the industry, and indexing the data object in the data warehousebased on a header number, the header number including the numericrepresentations of the geographic location and the industry.

[0014] In a fourth embodiment, the invention provides a computer systemcomprising a microprocessor, a storage device coupled to themicroprocessor, the storage device adapted to store software routines,and a software routine stored on the storage device to be executed bythe microprocessor, wherein the software routine comprises instructionsto perform a method of retrieving a data object stored in a datawarehouse. Said method comprises receiving a request for the data objectthat is stored in the data warehouse, parsing the request to identify ageographic location to which the data object is related, associating anumeric representation with the geographic location, parsing the requestto identify an industry to which the data object is related, associatinga numeric representation with the industry, generating a header numberthat includes the numeric representations of the geographic location andthe industry, searching a header number index of the data warehouse forthe header number, identifying the data object based on the headernumber, and retrieving the data object from the data warehouse.

[0015] In another embodiment, the invention discloses a storage devicereadable by a machine, tangibly embodying a program of instructionsexecutable by the machine to perform a method of storing data objects ina data warehouse, said method comprising receiving a data object,identifying a geographic location to which the data object is related,associating a numeric representation with the geographic location,identifying an industry to which the data object is related, associatinga numeric representation with the industry, and indexing the data objectin the data warehouse based on a header number, the header numberincluding the numeric representations of the geographic location and theindustry.

[0016] In still another embodiment, the invention provides a storagedevice readable by a machine, tangibly embodying a program ofinstructions executable by the machine to perform a method of retrievinga data object stored in a data warehouse, said method comprisingreceiving a request for the data object that is stored in the datawarehouse, parsing the request to identify a geographic location towhich the data object is related, associating a numeric representationwith the geographic location, parsing the request to identify anindustry to which the data object is related, associating a numericrepresentation with the industry, generating a header number thatincludes the numeric representations of the geographic location and theindustry, searching a header number index of the data warehouse for theheader number, identifying the data object based on the header number,and retrieving the data object from the data warehouse.

[0017] In yet another embodiment, the invention discloses a method ofoperating an electronic switch including receiving a plurality of dataobjects, storing the data objects in a plurality of data comparitors,receiving a first signal indicating that all of the comparitors arebusy, receiving an additional data object, providing a holding area fordata objects, storing the additional data object in the holding area,receiving a second signal indicating that a comparitor is free, andstoring the additional data object in the comparitor.

[0018] An objective of Supermine™ is to have the capability ofcollecting seemingly unrelated data objects, such as documents, inwholesale amounts, categorize the data objects by geographic location,industry, and fields of study or specific functions and make themeconomically available as needed. Another objective of the disclosedinvention is that the received data objects be categorized, screened toeliminate extraneous and duplicated data and indexed to enable rapidstorage and retrieval. A further objective of the disclosed invention isthat the retrieval mechanisms enable Industry Analysts, Supermine™ staffand qualified clients to easily extract data objects or documentscontaining data indexed by location, language, industry, function, orfield of study with little intervention by humans. These objectives areaccomplished via the following series of operations exclusive toSupermine™. These and many other objectives and advantages of thepresent invention will be apparent from the claims and from the detaileddescription presented below in connection with the accompanyingdrawings.

DESCRIPTION OF THE DRAWINGS

[0019]FIGS. 1A-1C show is a block diagram of the Supermine™ systemshowing the data acquisition components and the data flow through thesystem in accordance with an embodiment of the invention.

[0020]FIGS. 1A-1 to 1A-3 show a block diagram showing how the Supermine™System reacts to a client's request for data, retrieves stored documentsfrom the Supermine™ Storage Warehouse and delivers them to clients,Supermine™ staff and analysts in accordance with an embodiment of theinvention.

[0021]FIG. 2 is a table depicting an illustrative Location Template inaccordance with an embodiment of the invention.

[0022]FIG. 3 is a table depicting an illustrative Industry Template inaccordance with an embodiment of the invention.

[0023]FIG. 4 is a table depicting the placement of numerals derived fromthe names of locations and industries into specifically assigned scanline slots in a Comparitor Match Buffer in accordance with an embodimentof the invention.

[0024]FIGS. 5A-5B show is a table depicting a Location and Industry KeyWord Template Storage Unit in accordance with an embodiment of theinvention.

[0025]FIGS. 6A-6B show a block diagram showing a method of moving anewly acquired document into an assigned location and industry warehousestorage position in accordance with an embodiment of the invention.

[0026]FIG. 7 is an illustrative data search questionnaire in accordancewith an embodiment of the invention.

[0027]FIG. 7A is an illustrative data search questionnaire, includinganswers, in accordance with an embodiment of the invention.

[0028]FIGS. 8A-8B show a table depicting an illustrative DocumentRetrieval Control Unit in accordance with an embodiment of theinvention.

[0029]FIGS. 9A-9B show a block diagram of an illustrative-Supermine™Scan Control/Comparitor in accordance with an embodiment of theinvention.

[0030]FIGS. 10A-10B show a three-dimensional table depicting anillustrative data warehouse in accordance with an embodiment of theinvention,

[0031]FIG. 11 is a block diagram showing an illustrative Supermine™Document Distribution Control in accordance with an embodiment of theinvention.

[0032]FIGS. 12A-12B show a block diagram showing an illustrativeDocument Distribution Center in accordance with an embodiment of theinvention.

[0033]FIGS. 13A-13C show a block diagram showing an illustrative DataReceiver/Key Word Comparitor in accordance with an embodiment of theinvention.

[0034]FIG. 14 is a block diagram showing an illustrative Supermine™inventory storage facility in accordance with an embodiment of theinvention.

DETAILED DESCRIPTION OF THE INVENTION

[0035] The present invention provides a method of applying the economicsof scale to the information technology and data fields just as theeconomies of scale and volume are utilized in other industries such asmanufacturing, sales or transportation. It teaches a method ofelectronically capturing, cataloging, indexing, storing, formatting andretrieving large volumes of raw data from thousands of diverse sourcesand locations with little need for human intervention. The inventionaccomplishes this by providing a unique means of electronically“reading” a document incoming from a semi-automated polling unit andgathering enough information to determine the document source, subjectmatter and language. These data are electronically translated into a setof assignment numerals to place documents in a data storage warehouse ordirect them to an industry analyst or client as desired.

[0036] Each Supermine™ warehouse storage location contains stored datarelating to a particular Nation, State or Province, City and specificlocations in that city as desired. Each nation storage location containsthe assignment methodology and storage capacity to store and retrievedata concerning up to 999 different industries within a State withinthat Nation, i.e. the oil and gas industries, telecommunications,timber, pharmaceuticals, manufacturing, transportation, trucking,hospitalization, medical, agriculture, construction,accounting/bookkeeping and similar endeavors. The individual storagepositions for each industry within the warehouse are capable of storingvast amounts of data in a manner enabling rapid and economicalretrieval. This is made possible by a unique digitally indexed templatethat serves as a “warehouse-loading scheme” and, in conjunction withother components, provides a menu of available documents and where theyare located within the storage warehouse. The templates also provide ameans of searching through other storage locations for data that mayrelate to information in other fields.

Block Diagram Description of the Supermine™ System

[0037] A preferred embodiment of the method and system of the presentinvention is set out below. It should be understood that many of thesensing, timing, copying, deleting, document moving and controlfunctions are done in software. Sufficient schematics of significantcomponents, bit registers, buffers, look-up tables and gating circuitsare shown in the drawings to enable understanding the concepts,configuration and operation of the Supermine™ system. FIGS. 1A-1C and1A-1 to 1A-3 comprise block diagrams of a system enabling theapplication of the economics of scale to the information technologyindustry. The Supermine™ mass data gathering process begins with asemi-automatic polling unit (1) that responds to commands from a datagathering team (2) and from any of a multiplicity of Industrial Analysts(3) who are experts in different industry disciplines.

[0038] Each Supermine™ analyst or staff member has two methods ofaccessing and activating the programmable polling unit (1). The firstmethod is used to search for a document known to relate to a specificlocation. This is done by way of the direct connection from the analyst(3) to the polling unit (1). The second method is used when the analystneeds new data from a particular location within a Nation, State or Cityrelative to a specific industry or field of study that is not alreadycontained in the Supermine™ data warehouse. The analyst selects thetemplate needed from the Location a n d Industry Word Template Storage,FIGS. 1A-C, (8) or FIGS. 5A-5B, and copies it into one of a multiplicityof Search Template Registers (4) that serve the polling unit (1). Theanalyst can then direct the polling unit (1) to search for documentsrelating to the data on that template. In addition, clients who havepurchased access and have been issued proprietary passwords and routingcodes can access Supermine from remote locations. The Routing CodeScreening Unit (5) controls the level of access to Supermine™. A seriesof Incoming Protocol Converters (6) are provided for customers usingdifferent computer protocols and machine languages. These convertershave an incoming section and an outgoing section to prevent transmissiondelays. When a customer makes an initial request for data, the IncomingProtocol Converter (6) recognizes the protocol used and converts it intothe machine language used by Supermine™. The incoming request is routedto a Security and Screening Unit (7) that returns a query menu, FIG. 7,to the customer that, when answered, (See FIG. 7A) confirms the client'sidentity, location, the level of access the customer has purchased, theaddress where the data is to be returned and the preferred language.When the security, billing and the data request have been satisfied, thecustomer is routed to another security gate requiring Daily Passwordsand Routing Codes (5).

[0039] When the customer satisfies these security measures he has fouroptions depending on the level of access to Supermine he has purchased.First, he can choose to speak to an individual Industry Analyst (FIGS.1A-1C, (3)) for data gathering information and assistance in polling theworld's Internet and other sources to locate the specific data needed.Alternatively, he may directly activate the polling unit FIGS. 1A-1C,(1) to search the Internet for the data by inputting his answers to theData Search Questionnaire (FIG. 7A) directly into the Search Templateregister (4) of the programmable polling unit (1). Otherwise, he can usea template provided by Supermine™ that is specific to his data requestand input it directly into the Document Retrieval Control Unit, FIGS.8A-8B (34), to search through Supermine™ for the data needed. Finally,he can complete the Supermine™ menu questionnaire and copy it into theDocument Retrieval Control Unit, FIGS. 8A-B (34), to retrieve documentsdirectly from the Supermine™ warehouse.

[0040] Referring to FIGS. 1A-1C, Data received as a result of pollingthe Internet and other sources is not presented to a hard copy printeror to a human as is done in the present state of the art. Instead, theincoming raw data is presented to one of a multiplicity of DataReceiving/Transmitters, (9) that temporarily accept the incomingdocuments, assigns a unique Supermine™ number and the date received.These identifying data are the beginning of a “header” that will staywith the document as long as it remains in Supermine™. The DataReceiver/Transmitter (9) then transmits the data into one of twoparallel paths. This “splitting” process is controlled by a BusyTransfer Switch, (12) that directs the incoming data into a DataReader/Key Word Comparitor, (11), FIGS. 13A-13C, or into the IncomingDocument Holding File, (10) depending on the dynamic data load presentedto the Data Reader/Key Word Comparitor, (11).

[0041] Another source of data that will reside in Supermine™ comes fromprinted matter, news articles, publications from Universities andMedical Schools, Trade Publications etc. A number of high speed OpticalScanners (13) have been provided to provide efficient inclusion of thesedata into Supermine™. The output of the optical scanners iselectronically presented to the Data Receiver/transmitters (9), andother components, in digital format and is processed the same as datafrom other sources. This substitute method is used for hard-copydocuments, news and industry journal articles that are to be stored inSupermine™.

[0042] The disclosed system is designed to poll, receive, index, storeand retrieve huge volumes of data from multiple sources throughout theworld. The system design is to provide Data Receiving/Transmitters (9)and other components in sufficient multiples to continually accept anddisburse data from and to all sources without delays. As previouslymentioned, the incoming document is assigned a “header” and transmittedto the Data Receiver/Key word Comparitor (11), or FIGS. 13A-13C. Therethe document is “read” electronically and, as the reading progresses,each word is compared with the key words of templates that have beeninserted into the comparitor. Since the holding time per document variesas to the number of words in the document, there is a possibility thatthe Data Receiver/Key Word Comparitors (11) may become overloaded. Tosolve that problem, a Busy Transfer Switch (12) is provided.

[0043] The operation of the Busy Transfer Switch (12) is as follows.When the last available Data Receiver/Key Word Comparitor (11) is seizedfor an incoming document, a signal is sent to the Receiver Busy SensingBuffer (19) which sends a command to the Busy Transfer Switch (12) toswitch the next incoming document to the Incoming Document Holding File(10) where it is stored in the order assigned by the DataReceiver/Transmitter (9). The Incoming Document Holding File (10) isdesigned to automatically move the lowest numbered document in its fileto the Document Resend Buffer Transmitter (14). A signal is sent to theBusy Transfer Switch FIGS. 1A-1C (12) when an idle receiver is detectedby the Receiver/Register Sensing Buffer FIGS. 1A-1C (19). This switch isprogrammed to “look” into the Document Resend Buffer/Transmitter (14)and trigger it to send any document in the buffer to the input of thenext available Receiver/Comparitor (11) for further processing.

[0044] At this point all incoming data presented to the DataReceiver/Transmitters (9) continue to be sent to the incoming DataDocument Holding File (10) in the manner previously described as long asthere is a document held in the Document/Resend Buffer Transmitter (14).When the last document stored in the Document/Resend Buffer Transmitter(14) has been sent to the Data Receiver/Key Word Comparitor (11) thusindicating that the Incoming document Holding File (10) is empty, theDocument Resend Buffer/Transmitter (14) sends a command to the BusyTransfer Switch (12) to again open the path from the Internet or OpticalScanners (13) directly into the Data Reader/Key Word Comparitors (11).This condition remains in effect until the last available DataReceiver/Key Word Comparitor (11) is seized and the process justdescribed is repeated. The above-described sequence provides a means ofmaintaining continuous data acquisition and serves as a “Traffic Cop” toassure that no data is lost because of overloaded Data Receiver/Key WordComparitors (11). This completes the explanation of how data arereceived from the Internet or the Scanners (13) and sent to the DataReceiver/Key Word Comparitors (11), or FIGS. 13A-13C.

[0045] The process of electronically reading incoming documentspresented to the Data Reader/Key Word Comparitor (11), or FIGS. 13A-13C,will now be described. Referring to FIG. 1, the DataReceiver/Transmitter (9) receives the incoming document from the pollingunit (1) or from the Optical Scanners (13) where a unique Supermine™document number and the date received is recorded as a “header” for thedocument. These data are put into the header in a specific order that isstandard for all Supermine™ registers, operations and functions. TheReceiver/Key Word Comparitor register (21), a component of the DataReceiver/Key Word Comparitor (11), is programmed to read this headerthat will become a permanent part of the document. At this point in theprocess, the particular industry template that will “match” the incomingdocument is unknown. For that reason, the industry template number isnot added to the header until document selection is complete.

[0046] The process of segregating incoming documents by location beginswith the use of the location templates, shown in FIG. 2. In like mannerthe process of identifying documents by language, industry, fields ofinterest within the industry, and also by functions, operations or otherdetails begins with the Industry Templates Figure (3). Referring toFIGS. 2 and 3, as each Location and Industry Template is constructed bythe Analyst, (3), it is copied into the Location and Industry WordTemplate Storage Unit, FIGS. 1A-1C (8), or FIG. 5 in the proper languagecolumn by commanding the Template Control Unit (17) to save thetemplates into the column for the language used in the template. TheTemplate Control Unit (17), in conjunction with the various componentsof the Data Receiver/Key Word Comparitor (11), enables Supermine™ toassociate any Location Template, FIG. 2, with any Industry Template,FIG. 3. Further, the design of the Location and Industry Key WordTemplate Storage Unit (8) makes it possible to associate any locationtemplate with any industry template in the language most commonly usedat the location that was detected by electronically “reading” thedocument.

[0047] Referring to FIGS. 1A-1C, the Data Receiver/Key Word Comparitor(11) receives the incoming document as previously described. Thecomparitor section of the Data Receiver/Key Word Comparitor (11) iscomprised of a Receiver/Incoming Document register (18) that accepts theincoming document, and a Receiver/Key Word Comparitor register (21) thatcontains the template with the key words that are “matched” againstwords in the incoming document to determine subject and origin of theincoming document. Also included in the Data Receiver/Key WordComparitor (11) are Comparitor Match Buffers (22, 23-“n”). ComparitorMatch Buffers 22 and 23 are “paired” registers that store the numeralsthat correspond to the matching words found by the Receiver/Key WordComparitor (21). The purpose for paired buffers is to prevent storingpartial documents or those referring to duplicate industry functions. Italso solves the problem of different locations having the same names,i.e. Pasadena, Tex. and Pasadena, Calif. The Buffers are configured toaccept the primary matching words in the first buffer, number 22, withall extraneous matching words going into buffer number (23). Buffers 24and 25 are paired as are buffers 26 and 27 etc. These registers areintegral parts of the Data Receiver/Key Word Comparitor (11) and areprovided in sufficient multiples to process all incoming documentswithout delay. Primary matching words are defined as those having theproper relationship with the template column and template scan linewhere the matching word was detected.

[0048] When document “reading” is complete, the content of buffer number(22) is sent to Supermine™ storage. The content of buffer (23) is sentto the Document Review file FIGS. 1A-1C, (38) where it is reviewed bythe Data Gathering Work Teams FIGS. 1A-1C, (2) and either madeacceptable for storage or deleted. A Template Control Unit (17) isdesigned to receive signals from the Data Receiver/Key Word Comparitor(11) and respond by copying and transporting templates and documentsbetween various Supermine™ components. It serves both types of templateswith the same register equipment. This begins with copying the locationtemplate from the template storage unit (8) into the Receiver/Key WordComparitor Register (21), a component of the Data Receiver/Key WordComparitor (11).

[0049] The incoming document is received into the Incoming DocumentRegister (18). When the complete document has been stored in theregister, a “document complete” signal is sent to the Receiver/RegisterSensing Buffer (19) which generates “location template start” signalinto the Template Control Unit (17) which activates the Locationtemplate Transport Control (20) causing it to enter the LocationTemplate Storage file (8) at position one of the location templatestorage unit. For this disclosure, the template stored in locationtemplate position one of the Template Storage Unit (8) is numberfifty-seven (057). This is the number of the nation storage location forthe United States (FIG. 2). The Location Template Transport Control (20)is programmed to copy the numbers “057” into the Nation scan line slotsin the Receiver/Comparitor Match Buffer (22) and then copy the entirecontent of Location Template 057 into the Receiver/Key Word Comparitor(21), a component of Data Receiver/Key Word Comparitor No. 1 (11).

[0050] When the complete incoming document is stored in the IncomingDocument Register (18) and the Location Template 057 is stored in thereceiver Key Word Comparitor Register (21) the search for matching keywords begins. Each word in the incoming document is “read” andelectronically compared with words on the template. Each word that isfound in the incoming document that is also on the template istranslated into numerals obtained from the Location and Industry KeyWord Template Storage Unit (8), or FIGS. 5A-5B.

[0051] When the comparitor completes scanning, translating andregistering all locations mentioned in the incoming document, a“location scan complete” signal is sent from the Receiver/Key WordComparitor (21) to the Receiver/Register Sensing Buffer (19). Thiscommands the Receiver/Register Sensing Buffer to scan the locationregister slots of the Receiver/Comparitor Match Buffers (22 and 23) todetermine whether there are numbers in all location slots in at leastone of those two registers. If so, the Receiver/Register Sensing Bufferuses that information to determine that location scanning is complete.The Receiver/Register Sensing Buffer (19) temporarily stores the numbers057 and sets a “location scan complete” bit. It then signals theTemplate Control Unit (17) to select and delete the location templatecopy in the Receiver/Key Word Comparitor (21). The Receiver/RegisterSensing Buffer (19) senses the empty register in the Receiver/Key WordComparitor (21) and signals the Template Control Unit (17) to load theIndustry Template into the Receiver/Key Word Comparitor (21). Thisprocess results in selecting an Industry template that is written in thelanguage of the location detected when the incoming document wassearched for location words.

[0052] The Receiver/Register Sensing Buffer (19) is programmed to sensethe empty register in the Receiver/Key Word Comparitor (21) and send acommand to the Template Control (17) to activate the Industry TemplateTransport Control (25). It also sends the Location numbers “057” to theTemplate Control Unit (17) causing it to command the Industry TemplateTransport Control (25) to enter the Location and Industry TemplateStorage Unit FIGS. 5A-5B at the Language Column 1 and scan through theLanguage Column Register (31) looking for the Location numbers “057”.For the purposes of the explanation, it finds the numerals “057” inLanguage Column 1, FIGS. 5A-5B, (30), the English Language version ofall Industry Templates. The Industry Template Transport Control (25)sets the template counter to prevent selecting that template again untilall others have been scanned. It then copies the first template listedin column 1 into the Receiver/Key Word Comparitor Register (21) where ascan search is made looking for any matching Key words. This results inthe document search being made with templates using the Englishlanguage. Scanning, word matching and converting key words into numeralsthat are then inserted into the Comparitor Match Buffers (22-23) for theIndustry words in the incoming document proceeds just as in the scanningfor location words.

[0053] Circuitry and programming is provided to copy the number of thetemplate that “matched” the incoming document into the template segmentof the header register when the industry scanning and word matching iscomplete. Provision is made in the program to combine the header fromComparitor Match Buffer (22) and the text from the Incoming DocumentRegister (18) and copy them into the Document Assignment Control FIGS.1A-1C, (24) or FIGS. 6A-6B. There the header is inserted into AssignmentNumber Register, FIGS. 1A-1C, (24-a) and the document text is insertedinto the Document Text register FIGS. 1A-1C, (24-c). This is a pairedregister combination that makes it possible to combine the header andthe text into one document or to transmit then as separate documents.This is done to facilitate populating the remote Supermine™ Warehouseand the remote Supermine™ inventory of FIGS. 1A-1C, (39) or FIG. 14.

[0054] When the document header has been received into the AssignmentNumber Register (24-a) and the text has been received into the DocumentText Register (24-b), “document received” signals are sent to theDocument Assignment Control FIGS. 1A-1C, (24). This causes the DocumentAssignment Control to generate a command to the Document TransportControl FIGS. 1A-1C, (28) to copy the combined document and header intothe Warehouse Storage Assignment Buffer FIGS. 6A-6B (29) where it isfurther processed into the correct Warehouse Storage Location. When thedocument and header were received into the Warehouse Storage Assignmentbuffer FIGS. 6A-6B (29), a “document received” signal was transmitted tothe Document Assignment Control FIGS. 1A-1C, (24). This generated acommand from the Document Assignment Control FIGS. 1A-1C, (24) to theDocument Transport Control FIGS. 1A-1C, (28) to copy the combinedcontent of the Assignment Number Register (24-a) and the Document TextRegister (24-b) into the off premise Supermine™ Warehouse Storage UnitFIGS. 1A-1C, (39) which is an exact duplicate of the on premiseWarehouse.

[0055] A “document received” signal was returned to the DocumentTransport Control FIGS. 1A-1C, (28) from the off premise Supermine™Warehouse Storage Unit FIGS. 1A-1C, (39). This triggered a command tocopy the content of the Assignment Number Register FIGS. 1A-1C, (24-a)into both the off premise and the on premise inventory units, FIGS.1A-1C, (37,39). This populated the on premises Warehouse and Inventoryand the off premises versions of both those units.

[0056] When the header and text of the document was removed from theComparitor Match Buffer (22) the text stored in the Incoming DocumentRegister (18) and the header stored in the paired Comparitor MatchBuffer (23) was also copied into the Document Review and Control UnitFIGS. 1A-1C, (38) where all incomplete, duplicated or questionabledocuments are sent for review. There the Data Gathering work teams makethe decision whether to complete the document and store it in Supermine™or to delete it. This completes the explanation of block diagram FIGS.1A-1C that describes, in general terms, the data acquisition, electronic“reading”, the method of indexing, assignment and storing documents inthe Supermine™ Warehouse. The above identification, assignment, indexingand storage system is unique to Supermine™ and forms the basis forretrieving documents containing specific data from among thousands orperhaps millions of stored documents and delivering those documents tothe person who requested them.

Full Description of the Supermine™ System

[0057] Block Diagrams 1 and 1A described in general terms how theSupermine™ System can randomly or selectively poll the Internet andother locations to acquire, electronically “read” the documents todiscern origin, subject matter, language etc. and assign a controlnumber to the document. Further the Block Diagrams have described howthat document is processed through the Supermine™ System to store it ina Supermine™ Warehouse, retrieve it from the Warehouse and deliver it toa customer who has requested the document with no human interventionexcept downloading the retrieved document to the client. Since puttingdocuments into the Supermine™ Warehouse, and retrieving specificdocuments from that warehouse is the focal point of most of thisdisclosure, the explanation should begin with the Warehouse itself.Referring to FIGS. 10A-10B, a word picture will be helpful to appreciatethe size and configuration of the data warehouse that would be requiredto store hard copies of the data it contains. Visualize a hypotheticalwarehouse with 1000-foot high ceilings covering two square miles.

[0058] Referring to FIGS. 2 and 10A-10B, assume that within thatwarehouse there are 399 freestanding shelf structures 100 feet wide and1000 feet high each named for a nation. Aisles ten feet wide separatethe shelf structures. The 399 separate nation shelf structures arenumbered alphabetically from Afghanistan (001) through Zaire (399). Thisnumber listing corresponds to column 1 of FIG. 2 that is a locationtemplate to be described later. The freestanding structure for theUnited States is numbered 057 on that template. Referring to FIGS.10A-10B, the freestanding shelf structure for each Nation has horizontalshelves from floor to ceiling 10,000 feet long and 1000 feet highdivided vertically into 999 segments 10 feet wide and 100 feet deep eachnamed for a State or Province within the Nation. The alphabeticallylisted state or province names are numbered to correspond to column 2 ofthe Location Template. This provides for ultimately storing data for upto 999 different industries in each nation location storage unit. Thesesegments are further subdivided to accommodate 999 smaller segments eachnamed for the major cities within the states or provinces andrepresenting column 3 of the location template. Each of these smallersegments is further subdivided into 100 discrete locations within, ornear the major cities and are represented by column 4 of the locationtemplate FIG. 2. These can be made as specific as desired by adding morecolumns to the template. All freestanding shelf structures areidentically configured as to storage locations. They differ only as tosize. Obviously the storage capacity of the shelf structures forAfghanistan and Zaire would be less than those for the US, the UnitedKingdom, Japan and Germany etc.

[0059] A look at the configuration of the location template, FIG. 2,will be the same as an overhead view of the divisions of the individualshelves of each freestanding storage unit. The above hypotheticalwarehouse obviously is not a practical building installation, but doessuggest a huge matrix containing thousands of discrete addresses capableof containing millions of electronically stored documents. Referring toFIGS. 6A-6B, the matrix is a huge data storage warehouse, FIGS. 10A-10B,that is divided by Nation, States or Provinces within the Nation, majorcities within that state or province and specific communities orlocations associated with the major city, each identified by a discretenumber. These numbers are part of a “header” for each stored document toidentify exactly where that document is stored in the Supermine™warehouse FIGS. 10A-10B. Each of these Location Storage segments houseall available data relative to the many industries that make up theeconomy in the Nation represented by the Warehouse storage location. Forexample, as shown on FIGS. 6A-6B, there is one warehouse storagelocation for Australia, one for Brazil, one for Canada, one for theUnited States etc. The warehouse storage location for the United Statescarries the assignment 057 because that is the scan line of FIG. 2 thatcontains the United States. This system is designed to be expandable upto 999 separate locations.

[0060] Still referring to FIGS. 6A-6B, observe that the industry storagepositions (16) are contained within those Nation, State, City andspecific warehouse storage spaces (15). There is assigned an industrystorage number 001 for Accounting/Bookkeeping, 002 for Agriculture, 006contains all available data relative to the Oil and Gas industries andnumber 999 contains all available data on the field of Zoology. Alldocuments that contain data relative to any industry or field of studyrelating to Australia or those that originated in Australia are storedin location 003. All documents to go into the Australia storage locationcarry the prefix “003”. All documents relating to the United States willcarry the prefix “057” All documents that relate to the Nation Zairewill carry the prefix “399”. Any Industry document relating to the Oiland Gas industries in the United States will carry the prefix 057006. Ifthat same Industry document related to drilling for oil in Zaire, itwould carry the header numerals of 399006. All warehouse storagelocations identify where the industry documents are stored. There can bea full range of industry documents stored in each of the warehousestorage locations.

[0061] There can be ninety-nine Industry data storage folders,expandable to 999, stored in each State, City or Specific Locationwarehouse storage position. There will be a data storage folder for theAccounting/Bookkeeping industry numbered 001. There will be a datastorage folder for the Agriculture industry numbered 002. The Oil andGas industry is numbered 006 with Zoology numbered 099. The IndustryTemplate, FIG. 3, corresponds to an overhead view of how the Industryregisters are configured within the location templates. Column 1 is alisting of the type industry, i.e. Accounting, Agriculture, Oil and Gas,Transportation, Taxes etc. There is an Industry template for each ofthese industries, and all are configured the same. To fully understandhow these objectives are achieved it is necessary to leave FIGS. 10A-10Bfor a time and refer to the figures explaining construction and use ofthe Supermine™ templates. Referring to FIG. 2, the numbering scheme forthe location templates is used as the configuration pattern for theSupermine™ document storage warehouse that provides a discrete storageaddress for any Industry document retained in the warehouse.

[0062] The disclosed system is designed to acquire data from anywhere inthe world and process it to make it available, as desired, to anywherein the world. Obviously any system with these objectives must begin witha means of identifying the location where the document originated, howit pertains to that location, the application of the field of study tothat specific location and the language used in the document. FIG. 2teaches a template listing many locations around the world that arelikely to need a bi-lateral exchange of data. Obviously this numberingsystem could be in machine language, as is done further into thecategorizing process. For purposes of illustration, the Nations incolumn 1 are listed alphabetically from Afghanistan to Zaire withAfghanistan being assigned the number 001 and Zaire being assigned thenumber 399. This means that the Supermine™ System is equipped to takedata from, and transmit to, 399 different Nations. In actual practice,the literal nation location template is selected and processed in theorder of the most frequent usage. As previously mentioned, the numbersderived from the industry template will become part of the header forany document originating or relating to a particular Nation. Inaddition, the analyst numbers each industry template as it completed.That template number will be included in the header for all documentsselected for storage using that particular industry template. Theindividual template number is one component of the “header” that formsthe basis for the Supermine™ inventory and document retrieval system.

[0063] Referring to FIG. 2, Column 1, for this disclosure Australia willcarry the number 003 in its header, Canada will be number 006 and theUnited States will carry the prefix 057 on all documents originating in,or relating to the United States or to an industry located in the UnitedStates. For purposes of clarity, column 1 of the template is called theprimary location of the document. Supermine™ needs to know, within aNation, what state or territory the document pertains to. Column 2 isthe secondary location. For the United States, all 50 states aresecondary locations. Therefore the State of Alabama carries the prefixof 057001. The 057 is for the United States, and the 001 is for theState of Alabama. In like manner the State of Texas would carry theprefix 057043 telling us that Texas is the 43^(rd) state listedalphabetically. Now we need to know to what city or large area withinthe State or Province the document relates.

[0064] Referring to FIG. 2, a few of the cities within Texas are listedto illustrate the Location Template. Houston is the fourth city incolumn 3. This list could be 999 cities as needed. Therefore the city ofHouston, in the State of Texas in the United States would carry thenumerals 057043004. To carry the example further, Pasadena, Tex., as asuburb of Houston, would be part of a list of locations near the largercity of Houston. If we assume that Pasadena was the first locationlisted as a suburb of Houston, it would carry the numerals 05704300401in the header of any document relating to Pasadena, Tex. The zeros tothe left of the significant numbers are to accommodate growth withoutredesigning the thousands of digit registers used in Supermine™.Referring briefly to FIGS. 10A-10B, notice the numbers on the LocationStorage assignments (15). The storage location for the United Statescarries the numeral 057. Any document referring to Pasadena, Tex. wouldbe found on the Texas scan line 043 in that storage location.

[0065] Now referring again to FIG. 2, column 1 lists the Nations inalphabetical order. Notice that the United States is found on scan line57=057. Now look at column 2. Starting on the column 1 scan line 57 forthe US, the States are listed in column 2 in alphabetical order. Forclarity, the scan lines for columns 3 through 5 are not shown. However,Alabama begins at the US scan line 57 of column 1 and begins a scan linefor column 2. Texas is found on column 2 scan line 43. That identifiesTexas as 043. Begin at the Texas scan line 043 and move across intocolumn 3. Notice that the Texas cities begin on that scan line and arelisted alphabetically. Notice that Houston is the fourth city listed,thus the number 004. The same procedure follows in the other columns.There is a listing of all Nations, States within the Nations, all majorCities within those States, Provinces or Territories, and the largerlocations or specific sites near, or in, those cities. Obviously thisnumbering system could identify more specific locations by simply addingadditional numbers to the templates. The Supermine™ location listing isvery large and requires huge amounts of computer memory. For thisreason, the numbering and document assignment system will be codecompressed or compandored to reduce the space required. There are manymethods of encoding or compressing the bit stream such as Binary CodedDecimal, Packet Switching Techniques, Code Compression etc. However, theprinciple remains the same.

[0066] In the illustration for this disclosure there are individualtemplates for all 399 Nations from Afghanistan to Zaire, allcorresponding to 399 Storage locations. FIGS. 10A-10B, (15). In theevent the storage location for larger nations such as the United States,The United Kingdom or Japan becomes two thirds full, provisions can bemade to program the Template to accept an auxiliary location numbering.This scheme can apply to all Location Storage positions, FIGS. 10A-10B,(15) or to the Industry folder assignments FIGS. 10A-10B, (16). Thesestorage compartments are actually huge folders in a large computer. Aseries of folders containing the data for the industry or fields ofstudy resides in every Location Storage Unit, (FIGS. 10A-10B, 15). It ishere that the position of each number in the numbering scheme becomesimportant. For that reason the first thirteen digits plus one space ofall document assignments are reserved for location digits.

[0067] For example: Corresponds to Template: Numeral Position 1 0 XNation Id. Column 1 X Numeral Position 2 0 X Identifies the State orColumn 2 X Province within a Nation Numeral Position 3 0 X Identifiesthe City or Column 3 X area within a State X or province NumeralPosition 4 0 X Identifies specific Columns 4-“n” = locations nearcities. Small locations

[0068] Each column of both the location templates and the industrytemplates has its own scan lines. Referring to FIG. 2, note that column1 shows scan lines numbered from 1 through 399. The purpose is to enablethe scanner to “remember” where a particular nation was found in thescanning process and begin at that point for future scans rather thanbeginning at the top of the scan at Afghanistan on every scan. In likemanner, (not shown for clarity) column 2 has scan lines from one throughfifty (one line per state) for the same reason. When the State of Texaswas found on scan line 43, the scanner could “remember” that and beginscanning for Texas cities at Column 2 scan line 43 rather than scanningdown from Alabama and progressing through all other intervening statesbefore getting to Texas. In like manner, Column 3 has its own scan linesas do the other columns. The same concept is used in FIG. 3. Thisnumbering scheme is also shown graphically on FIG. 4. This completes thebasic numbering scheme for the Location Template (FIG. 2).

[0069] Referring to FIG. 3, as explained below, the storage locationscarry the same numbering scheme as the template columns of the Industryor field of knowledge Template. In the agricultural example, only onesegment of the total industry is shown, however, the format for othersegments and other industries is the same and a similar template isconstructed for each of them. Because of the worldwide operation ofSupermine™, industry templates carry the prevalent language of theNation relevant to the content or origin of incoming documents. As isthe case for the Location Template (FIG. 2) the Industrial Template(FIG. 3) will be the Industry numbering scheme defining the discreteassignment of each document within the warehouse storage location.Documents will be stored in the Industry Data Storage folders (FIGS.10A-10B, 16) that reside in the Nation storage units FIGS. 10A-10B,(15).

[0070] Refer briefly to FIGS. 6A-6B (16). Notice that there is a storagefolder for Alabama, Number 001. There is a folder for the Transportationindustry (036) in Alabama. There is a folder for Wyoming, 050, thatincludes a folder for the Oil and Gas Industries (006) in Wyoming. Thereis a folder for Agriculture (002) in Texas. There is a separate Templatefor each of these industries and in addition; each industry template isduplicated in the most prevalent language of all Nations represented inthe storage Warehouse. The Analysts who prepares the templatesdetermines the specificity of detail stored for these industries by hisuse of the columns of the template. He also adds a discrete number toevery template as it is completed.

[0071] The above information can be understood by referring to FIG. 3.Column 1 refers to the digit position in the broad (or primary) industrycategory. In this particular case the agriculture industry was used asan example. The primary industry category will be recorded in the column1 position the secondary in the column 2 position etc. Referringtemporarily to FIGS. 10A-10B, (16), note that the warehouse storagelocation for agriculture is numbered “002”. All documents relating toagriculture carry the primary number of “002” and are always in thecolumn 1 position regardless of their warehouse location FIGS. 10A-10B,(15). There are agriculture documents (002) in the location storageposition for Kentucky, (017) and agriculture documents (002) in thestorage location position for Arkansas, (004). Since there are manysynonyms for agriculture, all commonly used terms for the agriculturalindustry are listed under column 1A on the agricultural template. Forexample, a document that discusses hydroponics will be listed in column1A and will be numbered (05). A document discussing crops will benumbered (06). A document discussing the harvest will be numbered (08)etc.

[0072] Industry Template Column 2 lists the specific aspect of theagricultural industry being discussed in a document. Assume that the385^(th) document received at Supermine™ discusses the String Bean cropin Pasadena, Tex.

[0073] The location header will be numbered:

[0074]057 designating the United States

[0075]043 designating the State of Texas

[0076]004 designating the City of Houston, Tex.

[0077]01 designating the community of Pasadena, Tex.

[0078] The numbers 05704300401 would tell the computer to store theincoming document in Location 057 at the addresses 04300401, which isassigned to Houston, Tex. and the suburb of Pasadena, Tex. When thewords “crops” or “harvest” or similar words are detected by the DataReceiver/Key Word Comparitor, FIGS. 13A-13C, (21) the comparitor putsthe numerals 00206 in the header for the document thus designating thatthe document refers to the Agriculture industry, that it will be storedin Industry folder 002 and that it will be further located in the“crops” location numbered “06”. When the words “bean, or beans” aredetected, the comparitor FIGS. 13A-13C, (21) adds the numeral “03” tothe header sequence thus designating that the crop being discussed inthe document refers to the bean crop.

[0079] To carry the example further, when the Data Receiver/Key WordComparitor, FIGS. 13A-13C (21) detects the word “string” or the words“String Beans” the comparitor adds the number 02 to the sequence thusdesignating that the crop in Houston, Tex. that is being discussed inthe document is the string bean crop. This is where the sequence wouldusually end. However, to carry the example further, (still referring toFIG. 3) if the document discussed the suitability of the soil inPasadena, Tex. for planting string beans, the number 04 would be addedto the number sequence probably followed by one or more of the factorsshown in column 5 of FIG. 3. Therefore, an incoming document thatdiscussed how to prepare the soil in Pasadena, Tex. to get the beststring bean crop would be assigned the numbers:

[0080]057004300401 sends the document to warehouse Location 057=the US,043=Texas, 004=Houston, the 01=Pasadena, Tex.

[0081]002=assigns the document to the Agriculture folder within the USstorage (057).

[0082]00206=the topic is “crops”

[0083]0020603=the crop being discussed is “beans”.

[0084]002060302=the type beans is “string beans”

[0085]00206030204=the topic relating to string beans is how to plantthem.

[0086]0020603020401=how to prepare the soil in Pasadena, Tex. forplanting string beans.

[0087] Therefore, a document that discussed planting string beans inPasadena, Tex. would carry the header: 0570430040100/0020603020401/. Ifthe document is selected for storage, the Supermine™ document number andthe template number will also be included in the header. There is alocation template for all locations for which data is collected andstored. These are “one time” templates that change only when locationsare deleted or added within Supermine™. The industry, or fields of studytemplates are “living documents” that are updated periodically to keepcurrent with progress and innovations within an industry or field ofstudy. The quality of the industry templates will determine the qualityof the Supermine™ product. For that reason, they are proprietary.Security measures are provided by this invention to protect theseintellectual properties. Refer to FIGS. 1A-1C, 7, and 5A-5B.

[0088] The invention, and these templates are not limited or restrictedas to content or utilization of the storage locations or the industryfolders. Only the depth of knowledge and ingenuity of the analyst whoprepares the template limits the quality, variety and level of detailstored in Supermine™. Templates can be prepared to range from a broad,generic type to one identifying or describing a fine grain, highlydetailed work operation. Provision is made for controlling the degree ofdetail collected before a document is sent to warehouse storage. Addingor reducing the number of columns on the template to “match” the levelof detail wanted does this. The more columns, the greater the detail.Fewer columns used result in more documents collected covering greaterscope of the industry. Referring to our previous example, if onlycolumns one and two were to be used on the polling search template, allagricultural documents that mentioned variety of beans would be acceptedinto the comparitor. Since our analyst needed only the data relating toplanting string beans in a specific location, the agriculture templatecopied into the Key Word Comparitor Register FIGS. 13A-13C, (21)contained a full set of numbers in all columns. This prevented storingany documents except those referring to planting string beans inPasadena, Tex.

[0089] The industrial world has the potential for exponential growth.National boundaries are virtually disappearing where internationalcommerce is concerned. The language barrier is a huge impediment to thisgrowth. Documents arrive at Supermine™ using many different languages.Using humans to translate them is very expensive and time consuming.Having humans translate documents is not compatible with Supermine™ massdata acquisition and distribution operation. Supermine™ solves thisproblem by collecting and disbursing data from and to multiple sourcesworldwide at very high speeds without using humans to read and makedecisions relative to language or translating documents. LocationTemplates (FIG. 2) and Industry Templates (FIG. 3) have been designed toovercome much of the language barrier. A template can be selected fromSupermine™ that conforms to the language used in the incoming document.The location and language of the incoming document is immediatelyidentified and a Location Template (FIG. 2) associated with thepertinent Industry Template (FIG. 3). This action is completed in theshort interval between identifying the location and language thatpertains to an incoming document and the beginning of the scans matchingkey words of the Industry Template (FIG. 3) with words in the incomingdocument.

[0090] Supermine™ accomplishes these functions with a unique Locationand Industry Key Word Template Storage Unit FIGS. 5A-5B that overcomesmuch of the language barrier and greatly reduces document transit time.Referring to FIGS. 5A-5B, the Location and Industry Key Word TemplateStorage Unit FIGS. 5A-5B (8) is a three-section matrix that containsevery template used in Supermine™. The first section contains thepermanent copy of all Location Templates listed in the order of thegreatest activity for the location of origin for incoming documents. Thefirst, number 057, represents the United States, number 051 representsthe United Kingdom; number 011 is for Japan etc. These numberscorrespond to the warehouse storage locations of FIGS. 10A-10B and6A-6B, (15) as previously explained. All assignment and storage is doneelectronically.

[0091] The nation names shown opposite the nation numbers and theindustry names opposite the industry numbers in FIGS. 5A-5B are alsoused in the document retrieval process and will be explained later. Thesecond section of the matrix is arrayed in columns; one column for eachlanguage to be accommodated by Supermine™. Column 01 of the matrixcontains an English language version of every Industry Template (FIG. 3)used in Supermine™. Column 02 is a Spanish version of every IndustryTemplate used in Supermine. The same templates are listed in column 03,with the exception that they are in the Japanese language. For clarityin this disclosure, the example shows that there can be 999 differentIndustry Templates (FIG. 3) in each of 1 through “N” columns dependingon the number of languages to be accepted in Supermine™. All 999templates will be identical except for language.

[0092] The third segment of the Industry side of the Location andIndustry Key Word Template Storage Unit FIGS. 5A-5B (8) is actually alarge register that contains the electronic equivalent of the numeralsof every National location served by Supermine™. The register isdesigned to conform to the column numbers of the language section. Forexample, the first segment of the register corresponds to the Englishcolumn 01. The location numerals for every English speaking nation(location) are inserted in the register in the English-speaking segment,i.e. column 1. The second segment, column 02, lists the same IndustryTemplates (FIG. 3) in the Spanish language and the location numerals forall Spanish-speaking nations (locations) are inserted in column 02, etc.The location numerals for all locations that use the Urdu language areinserted in column “n”. The operation and use of this matrix will becomeapparent as the description progresses.

[0093] The method of polling and receiving data from many sources andgetting those data into a Data Receiver/Key Word Comparitor FIGS.13A-13C (11) has been described. In addition, the design of the locationtemplates, FIG. 2, the industry templates FIG. 3, and the basicconfiguration of the Location and Industry Key Word Storage Unit FIGS.5A-5B were explained. The configuration and operation of the warehouselocation assignments were further explained on FIGS. 10A-10B, (15). TheData Storage folder configurations for each industry, FIGS. 10A-10B,(16) have been explained. Notice that each location and industrytemplate is numbered within the Location and Industry Template StorageUnit FIGS. 5A-5B, (8) and that these numbers correspond to the warehouselocation assignment of FIGS. 10A-10B, (15). The example used in thedisclosure shows that there could be as many as three hundred ninetynine location templates with a corresponding warehouse storage locationFIGS. 10A-10B, (15) for each Location Template.

[0094] The first location template listed on FIGS. 5A-5B (8) is number057 for the United States. This is because the warehouse storagelocation, FIGS. 10A-10B, (15) for the US is the most heavily used. Thesecond location template listed is for the United Kingdom and is thesecond most frequently used location. This order follows through alllocation template assignments in the order of usage. The purpose of thisis to reduce scanning time during location template selection. It wouldbe inefficient to scan through Afghanistan before scanning the moreheavily used locations because the probability of a document referringto that location is much lower than industrialized locations such as theUnited States, The United Kingdom, Canada, Australia, Germany, Japan,etc.

[0095] The templates stored in the Location and Industry Key WordTemplate Storage Unit FIGS. 5A-5B (8) are stored in machine language.Since all “reading” is done electronically the plain language names forlocations, industries and industry functions must be converted intonumerals to store and retrieve documents from the warehouse Nationlocations, FIGS. 10A-10B, (15). (See client data request answers on FIG.7A). The Location and Industry Template Storage unit FIGS. 5A-5B, (8) isconfigured and programmed to function as a conversion look-up table toaccomplish this purpose.

[0096] Referring still to FIGS. 5A-5B, observe that the location sectioncarries both the location name and the corresponding numeralidentification. In like manner the industry section carries both thename of the industry and the numeral equivalent. These names and numbersare electronically linked in the program so that when a scan search ismade using the plain language name, the result is a reply to the pollingunit using the numeral equivalent of the name. For example, if a scansearch was made using the words “US” or “United States”, the scannerwould send back the numerals “057”. Conversely if the scan were madeusing the numerals “057” the resulting reply would be “US” or “UnitedStates”. In like manner if a scan were to be made using an industry namesuch as “Agriculture” the scanner would send back the numerals “002” andvice-versa. The scanners are under the control of the Template ControlUnit (17) of FIGS. 1A-1C. If the volume of traffic between a non-Englishspeaking Nation and the United States becomes so great as to become aproblem, a language conversion process could be inserted between theProtocol Converters, FIGS. 1A-1C, (6) and the Security Screening Unit,FIGS. 1A-1C, (7) to facilitate communication with the analyst.

[0097] When the scan begins, the numerals 057 in location slots one, twoand three, (the primary location slots) are the only numerals availablein the Receiver/Key Word Comparitor FIGS. 13A-13C (21) register forcomparison with the incoming document. All secondary and other slots areclosed at this time. If there are no nation words in the documentreferring to the United States, the Receiver/Key Word Comparitor FIGS.13A-13C (21) “steps” over to column 2 (the secondary location slots) ofthe location template and begins a scan for any state names in the US.If none are found the Comparitor then scans for city words in columnthree. After the third scan, a “no match” signal is sent to theReceiver/Register sensing Buffer FIGS. 13A-13C (19) which sends acommand to the Template Control Unit FIGS. 1A-1C (17) to delete thecontents of the Receiver/Key Word Comparitor FIGS. 13A-13C (21) registerand re-enter the Location And Industry Key Word Template Storage UnitFIGS. 5A-5B (8) at position two of the Location Template listings whichis for the United Kingdom. The above process is repeated until a Nationlocation is detected and the scan is continued as described later. Inthe event there is no indication of the nation location or any state ormajor city that would indicate a location, the following default actiontakes place.

[0098] The Location Template Transport Control FIGS. 1A-1C (20) and theIndustry Template Transport Control FIGS. 1A-1C (25) have an internalcounter that “makes busy” the templates selected to prevent selectingthem a second time in another scan until all other templates have beenscanned. It then “steps” to the next template number. This innovationassures that all templates are selected and scanned for that documentbefore repeating a template selection. That selection sequence isretained as long as the same document remains in the Receiver/IncomingDocument Register FIGS. 13A-13C (18).

[0099] When the Template Transport Control FIGS. 1A-1C (17) counterreaches 399, indicating that all templates have been scanned with nodocument location found, the Receiver/Key Word Comparitor FIGS. 13A-13C(21) register sends a “no match” signal to the Receiver/Register SensingBuffer FIGS. 13A-13C (19) which commands the Template Control FIGS.1A-1C (17) to delete the last location template in the Receiver/Key WordComparitor FIGS. 13A-13C (21) and to again enter the Location andLocation And Industry Key Word Template Storage Unit FIGS. 5A-5B (8) atposition one of the Location Template listings which is for the UnitedStates. The Template Control FIGS. 1A-1C (17) checks that the templateselection counter is at 399 template scans, and enters the templatestorage to select the United States as the default location. It againsends the numerals 057 into the Receiver/Key Word Comparitor RegisterFIGS. 13A-13C (21) immediately followed by the entire 057 locationtemplate. This template remains in the comparitor until the incomingdocument location scanning and comparing is complete. When the Templatecontrol Unit FIGS. 1A-1C (17) is commanded by the Receiver/RegisterSensing Buffer FIGS. 13A-13C (19) to clear the incoming document fromthe Receiver/Incoming Document Register FIGS. 13A-13C (18), it alsoresets the counters in both the Location and Industry Transport ControlsFIGS. 1A-1C (20, 25).

[0100] Assuming that we now have a legitimate location template storedin the Receiver/Key Word Comparitor register FIGS. 13A-13C (21), thefollowing action continues. When the Receiver/Register Sensing bufferFIGS. 13A-13C, (19) receives the signal that the Location template isstored in the Receiver/Key Word Comparitor FIGS. 13A-13C, (21) it isprogrammed to close all register slots in the Receiver/Comparitor MatchBuffers FIGS. 13A-13C (21 and 22) except those reserved for theSupermine™ document number and the date the document was received. Whenthe Receiver/Register Sensing Buffer FIGS. 13A-13C, (19) senses thatthese slots are filled, it re-opens all Location and Industry slots inthe register.

[0101] When the complete location template has been received, thecomparison of key words in the location template with any key words inthe document begins as the template is “read” into the Receiver/Key WordComparitor Register FIGS. 13A-13C (21) The first words in any incomingdocument are the Supermine™ number and the date received. All registersare programmed to give these numerals an automatic “match” and to insertthem into the Receiver/Comparitor Match Buffers FIGS. 1A-1C (22,23) aspart of the header for the document being scanned. All numerals areinserted into the Receiver/Key Word Comparitor Match Buffers FIGS.13A-13C (22,23) in specific positions to prevent errors or duplications.For a better understanding of this technique, refer to the configurationof the Receiver/Comparitor Match Buffers (22-“n”) of FIG. 4.

[0102] Refer to FIG. 4 for a graphic explanation of the header. Fromleft to right, the first thirteen-register spaces plus one separatorsymbol (/) space are dedicated to identifying the location informationof the document. This will be the document assignment into theSupermine™ warehouse.

[0103] The next thirteen-register positions plus one space symbol (/)are reserved for designating the Industry, functions and processes thatoccur within that industry. These will determine the Industry assignmentfor the document within the Location assignment. The next tenregister-spaces plus one space symbol (/) are reserved for recording theSupermine™ document number. This number is put on the document when itis received by the Data Receiver Transmitter FIGS. 1A-1C (9). The nextten-register spaces plus one space symbol (/) are reserved for thenumber of the template actually used to identify the document ultimatelyselected. This number, in connection with the Supermine™ number on thedocument, forms the basis of a method of document retrieval and aninventory system. It is placed in the document header after it isdetermined that the document will be selected and stored in thewarehouse. The next nine-register positions are reserved for the datethat the document was received with the remaining 5 spaces in theregister containing zeros to indicate unused spare.

[0104] The date received, and the Supermine™ document number areinserted into the header of every incoming document by the DataReceiver/Transmitter FIGS. 1A-1C (9) and are placed at the top of everydocument. The Receiver/Register Sensing Buffer FIGS. 13A-13C, (19)controls the opening and closing of register slots to assure getting theincoming data into the proper Comparitor Match Buffer slots. These dataare always automatically placed in the same slots in theReceiver/Comparitor “Match” Buffer FIGS. 13A-13C (22) and in theReceiver/Comparitor Match Buffer FIGS. 13A-13C (23) for use in theDocument Review File FIGS. 1A-1C, (38).

[0105] To accommodate growth without re-programming all registers, allnumeral slots are filled from right to left with the least significantnumber entered first and the most significant number entered last. Theremaining header numeral slots are filled with zeros by the DataReceiver/Transmitter FIGS. 1A-1C (9) as it makes the header assignment.This prevents insertion errors and makes it easy for theReceiver/Register Sensing Buffer FIGS. 13A-13C (19) to sense that theheader numeral and date slots are filled. This also makes it possiblefor the Receiver/Key Word Comparitor FIGS. 13A-13C (21) to automaticallycopy these numerals into the proper slots in the Receiver/ComparitorMatch Buffers FIGS. 13A-13C (22-“n”). This is accomplished byprogramming the Receiver/Register Sensing Buffer FIGS. 13A-13C (19) toopen and close the Receiver/Comparitor Match Buffer FIGS. 13A-13C(22,23) numeral slots in a controlled sequence to assure the desirednumerals are inserted in the register at the proper time and in theproper sequence.

[0106] For example, all Receiver/Comparitor Match Buffer FIGS. 13A-13C(22) register numeral slots are normally closed. When an incomingdocument has been received into the Incoming Document Register FIGS.13A-13C (18), a “document complete” signal is sent from the IncomingDocument Register FIGS. 13A-13C (18) to the Receiver/Register SensingBuffer FIGS. 13A-13C (19. This causes the Receiver/Register SensingBuffer FIGS. 13A-13C (19) to activate the Template Control Unit FIGS.1A-1C (17) and at the same time, open the three numeral slots in theComparitor Match Buffers FIGS. 13A-13C (22,23) that are reserved for thenation location digits. It also opens the correct register slots for theSupermine™ document numbers, and the date slots. A shown in FIG. 4, ifthe document header number happened to be 385, the ten slots wouldcontain the numerals and symbol 0000000385/. When the Receiver/RegisterSensing Buffer FIGS. 13A-13C (19) senses that all eleven digits havebeen received, it opens the nine spaces in the register of theReceiver/Comparitor Match Buffer FIGS. 13A-13C (22,23) that are reservedfor the date the document was received. If the document was receivedNov. 28, 2000 the numerals inserted in the date slots of the registerwould look like this: Nov. 28, 2000/. At this time in the documentscanning process the Comparitor Match Buffer FIGS. 13A-13C (22), alsosee FIG. 4, would look like this:0570000000000/0000000000000/0000000385/0000000521/11/28/00/00000

[0107] All other register slots remain closed until theReceiver/Register Sensing Buffer FIGS. 13A-13C (19) detects that the“nation” location slots, the Supermine™ number and date slots arefilled, it then closes these register slots and opens the ten slots forthe State, City, Community etc. plus one space slot. When the abovesequence is complete, the Receiver/Key Word Comparitor Register (21)begins the search for words in the document stored in theReceiver/Incoming Document register FIGS. 13A-13C (18) that “match”words used in the template. This operation is very similar to the “Find”command of PC operation with the exception that the comparitor isprogrammed to “look” at the first word of column 2 of the LocationTemplate that is in the Receiver/Key Word Comparitor Register FIGS.13A-13C (21) then search through the bits that are stored in theIncoming Document Register FIGS. 13A-13C (18) for a word that matchesthe first word of the location template.

[0108] For example, refer to FIG. 2. The first location templateselected for scan and copied into the Receiver/Key Word ComparitorRegister FIGS. 13A-13C (21) is number fifty-seven that is the assignednumber (057) for the United States. The Receiver/Comparitor Matchbuffers FIGS. 13A-13C (22,23) already contain the numerals 057 in thecolumn 1 (nation) slot of the register. Therefore, the scan actuallybegins in column 2 for the individual states in the US. The first wordof column 2 of FIG. 2 is “Alabama”. The comparitor searches the incomingdocument for the word “Alabama” and not finding a match, then “looks” atthe second word of the template that is “Alaska”. Not finding a match,the search continues until the comparitor makes the 43^(rd) attempt andfinds a match with the State of Texas. The comparitor “marks” scan lineforty-three (43) where Texas was found and inserts the numeralsforty-three (043) into the column 2 slot of the Receiver ComparitorMatch Buffers FIGS. 13A-13C (22,23) and shifts to column 3 of FIG. 2.

[0109] To prevent having the comparitor scan and search through allcities in all states beginning with Alabama until it gets down to Texas,the comparitor is programmed to begin scanning column 3 at the scan linewhere it found the State of Texas, which was scan line 43 of thelocation template. Starting on scan line 43, Column 3 of the locationtemplate, the first city in Texas on the template is “Austin,” no matchis found. The cities of Dallas, and Fort Worth are scanned but no matchwas found. On the next scan, the comparitor found the word “Houston” inthe document. It immediately inserted the numerals “004” in the column 3register slots of the Receiver/Comparitor Match Buffers FIGS. 13A-13C(22,23) following the forty-three (043) for the State of Texas. On thecolumn 4 scan, the comparitor found a match for the community ofPasadena, Tex., which is a close suburb of Houston. When the match wasfound, the comparitor inserted the numeral one (01) into the column 4slot of the Receiver/Comparitor Match Buffers FIGS. 13A-13C (22,23).When the comparitor “stepped” to column five (5) it found that there wasnothing entered in this column on this particular template. Thecomparitor inserted zeros (00) in the column five (5) slot of theReceiver/Comparitor Match Buffers (22,23) and signaled “location scancomplete” to the Receiver/Register Sensing Buffer FIG. 13A (19). This isa command for the Receiver/Register Sensing Buffer FIGS. 13A-13C (19) to“look” into the location slots of the Receiver/Comparitor Match BuffersFIGS. 13A-13C FIG. 13 (22 and 23) to determine whether there are numbersin all location slots in either Receiver/Comparitor Match Buffers FIGS.13A-13C (22 or 23). If so, the Receiver/Register Sensing Buffer FIGS.13A-13C (19) uses that information to know that location scanning iscomplete. It temporarily stores the “057” location number and at thesame time, it sets a “location scan complete bit” and activates thesensing unit to send a signal to the Template Control Unit FIGS. 1A-1C(17) to delete the copy of the location template in the Receiver/KeyWord Comparitor FIGS. 13A-13C (21).

[0110] The following describes the process of determining the languageused in the incoming document. The Receiver/Register Sensing BufferFIGS. 13A-13C (19) is programmed to sense the empty register in theReceiver/Key Word Comparitor FIGS. 13A-13C (21) and send a command tothe Template Control FIGS. 1A-1C (17) to activate the Industry TemplateTransport Control FIGS. 1A-1C (25). It also sends the Location numbers“057” to the Template Control Unit FIGS. 1A-1C (17) causing it tocommand the Industry Template Transport Control FIGS. 1A-1C (25) toenter the Location and Industry Template Storage Unit FIGS. 5A-5B at theLanguage Column 1 and scan through the Language Column Register (31)looking for the Location numbers “057”. For the purposes of theexplanation, it finds the numerals “057” in Language Column 1, FIGS.5A-5B, (30), the English Language version of all Industry Templates. TheIndustry Template Transport Control FIGS. 1A-1C (25) sets the templatecounter to prevent selecting that template again until all others havebeen scanned. It then copies the first template listed in column 1 intothe Receiver/Key Word Comparitor Register FIGS. 13A-13C (21) where ascan search is made looking for any matching Key words. For purposes ofthis explanation, none were found on scan line 001 (Accounting). A “scancomplete” signal was sent from the Receiver/Key Word Comparitor FIGS.13A-13C (21) to the Receiver/Register Sensing Buffer FIGS. 13A-13C (19)that is programmed to “look” into the Receiver/Key Word ComparitorRegister FIGS. 13A-13C (21) to determine the condition of the register.It finds the Location template slots filled, but since it has set a“scan complete” bit from the Receiver/Key Word Comparitor Register FIGS.13A-13C (21) it again stores the Location numbers “057”, sets an“industry scan complete” bit and signals the Template Control Unit FIGS.1A-1C Figure (17) to delete the content of the Receiver/Key WordComparitor Register FIGS. 13A-13C (21). Again, the Receiver/RegisterSensing Buffer FIGS. 13A-13C (19) senses the empty Receiver/Key WordComparitor Register FIGS. 13A-13C (21) and sends the Location Numbers“057” along with a command to the Template Control FIGS. 1A-1C (17) toactivate the Industry Template Transport Control FIGS. 1A-1C (25). Thiscauses the Industry Template Transport Control to enter the Location andIndustry Template Storage Unit FIGS. 5A-5B at the language columnregister FIGS. 5A-5B, (31) and search for a match with the numbers“057”. A match is found at scan line 2 of the Language Column 1 which isfor the English language.

[0111] Because Industry Template “001” (Accounting) had already beenselected, rejected and made unavailable to the next scan, Templatenumber “002” for Agriculture is the next template selection. Thiscorresponds to the number of the industry warehouse storage location foragriculture. The Industry Template Transport Control FIGS. 1A-1C (25) isprogrammed to immediately insert the number “002” into the Industrynumeral slots, the only slots open at this time, in theReceiver/Comparitor Match Buffers FIGS. 13A-13C (22/23) and then copythe content of Industry Template 002 into the Receiver/Key WordComparitor FIGS. 13A-13C (21) that is a component of Data Receiver/KeyWord Comparitor No. 1 FIGS. 13A-13C (11).

[0112] When the industry template has been received, the comparison ofkey words in the Industry template with key words in the document beginsas the template is “read” into the Key Word/Comparitor Register FIGS.13A-13C (21). In this case, when the Receiver/Key Word Comparitor FIGS.13A-13C (21) attempts to insert the document header into the ComparitorMatch Buffers FIGS. 13A-13C (22/23) it finds these slots already filledwith the exception of the slots for the number of the template that areopen but empty at this time. It is programmed to “re-scan” when thiscondition occurs. It re-scans the header positions in the incomingdocument, and inserts the number of the template being used in thetemplate number slots into the Key Word/Comparitor register FIGS.13A-13C (21) and proceeds to the document content scan.

[0113] Referring to FIG. 3, the Agriculture designation 002 has alreadybeen inserted in the Comparitor Match Buffers FIGS. 13A-13C (22,23). TheReceiver/Key Word Comparitor FIGS. 13A-13C (21) begins the scan incolumn 1 of the template by searching the document for the word“farming”. No such word is found. It then “looks” back at the templateand begins a search for the word “agronomy,” and again no such word isfound. This process continues until the sixth scan where the word“crops” is found. It “marks” the scan line (6) and inserts the numbersix (06) into the Receiver/Comparitor Buffers FIGS. 13A-13C (22,23) atthe Industry column 1A slots in the register.

[0114] It should be noted at this point in the operation that theReceiver/Register Sensing Buffer FIGS. 13A-13C, (19) closes the column1A slots in the register and opens the column 2 slots. This action takesplace progressively on every scan until the register is filled.

[0115] The Comparitor then begins on scan line six of column 2 andsearches through the words “wheat” and “corn” with no match found. Onthe third scan, a match for the word “beans” is found. The Comparitorthen marks scan line three and inserts the number three (03) into theComparitor Match Buffers FIGS. 13A-13C (22,23) at the column 2 slots ofthe register, closes those column 2 slots and opens the column 3 slots.

[0116] The Comparitor then “steps” to template column 3 and beginssearching at scan line three for column three matching words. Itsearches for “soy beans” and finds no match. It then searches for thecommonly paired words “string beans” and finds a match. It “marks” scanline two and inserts the number two (02) into the Comparitor MatchBuffers (22,23) at the column 3 slots of the register. This process isrepeated for columns four (4) and five (5). The number four (04) isinserted in the register slot for column 4 and the number one (01) isinserted into the register slot for column 5. The Comparitor has nowinserted the Figures 0020603020401/in the industry slots in the ReceiverComparitor Buffer FIGS. 13A-13C (22).

[0117] From left to right in the buffer the numbers are inserted asfollows: Nation location Id. 057 = The location for the US Statelocation Id. 043 = The location for the State of Texas City location Id.004 = The location for the City of Houston Community location Id.  01 =The location for the Community of Pasadena Column 5 = Vacant =  00 Theindustry category 002 = The industry discussed (agriculture) Theindustry application 006 = The application within the industry —CropsThe specific of the function  03 = The Bean Crop The specific crop  02 =The “String Bean” crop. The “what about it”  04 = The type soil requiredfor string beans More “what about it”  01 = How to prepare the soil forplanting string beans Supermine ™ number 0000000385/ Template number0000000521/ Date Received 11/28/00/ = November 28, 2000 Summary =0570430040100/0020603020401/0000000385/0000000521/11/28/00/00000

[0118] These fifty nine (59) digits and symbols represent receiving adocument on Nov. 28, 2000, numbering it as Supermine™ Document 385,recording the template used to locate the document, 521, learning thelanguage used in the document, English, “reading” it and learning thatit pertains to the type soil and how that soil should be prepared ifsomeone is going to plant string beans in Pasadena Texas in the UnitedStates of America, all without a human having seen the document.

[0119] Refer to FIGS. 13A-13C for the process of storing documents. Whenthe last slot in column 5 is filled, the comparitor signaled “industryscan complete” to the Receiver/Register Sensing Buffer FIGS. 13A-13C(19). This is a command for the Receiver/Register Sensing Buffer FIGS.13A-13C (19) to “look” into the Industry slots of the ReceiverComparitor Match Buffer FIGS. 13A-13C (22) to determine whether thereare numbers in all active register slots. If so, the Receiver/RegisterSensing Buffer FIGS. 13A-13C (19) uses that information to know thatindustry scanning is complete, and that the industry template stored inthe Receiver/Comparitor Register FIGS. 13A-13C (21) can be deleted. Atthe same time, it sets an “industry scan complete” bit and signals theTemplate Control Unit FIGS. 1A-1C (17) to delete the copy of theindustry template in the Receiver/Key Word Comparitor FIGS. 13A-13C(21). The Receiver/Register Sensing Buffer FIGS. 13A-13C (19) isprogrammed to sense the empty register in the Receiver/Key WordComparitor FIGS. 13A-13C (21) and send a command to the Template ControlFIGS. 1A-1C (17) to activate the Match Buffer Transport Control FIGS.1A-1C (26) and command it to copy the content of the Comparitor MatchBuffer FIGS. 13A-13C (22) into the Assignment Number Register, acomponent of the Document Assignment Control buffer FIGS. 6A-6B, (24).This buffer has two registers. The upper register, (24-a) is identicalto the Receiver/Comparitor Match Buffers FIGS. 13A-13C (22,23) (also seeFIG. 4), has the same number of slots for numerals and is dedicated tostoring the content of the Receiver/Comparitor Match Buffer FIGS.13A-13C (22).

[0120] When the Match Buffer Transport Control FIGS. 1A-1C (26)completes copying the data in the Comparitor Match Buffer FIGS. 13A-13C(22) into the Document Assignment Control buffer FIGS. 6A-6B (24) itsends a “task complete” signal to the Receiver/Register Sensing BufferFIGS. 13A-13C (19), which commands the Match Buffer Transport ControlFIGS. 1A-1C (26) to scan the content of the Assignment Number RegisterFIGS. 6A-6B (24-a) a component of FIGS. 6A-6B (24) to verify that thecontent of the Comparitor Match Buffer FIGS. 13A-13C (22) has beencopied with no errors. If no errors are found, the Match bufferTransport Control FIGS. 1A-1C (26) sends a “task complete” signal to theReceiver/Register Sensing Buffer FIGS. 13A-13C (19) that commands theMatch Buffer Transport Control FIGS. 1A-1C (26) to enter the receiverIncoming document Register FIGS. 13A-13C (18) to copy and store the lastten words of that register into a comparitor segment of the Match BufferTransport Unit. When the integrity of the data stored in the AssignmentNumerals register segment of the Document Assignment Control Buffer hasbeen verified the Match Buffer Transport Control FIGS. 1A-1C (26)signals the Template Control FIGS. 1A-1C (17) to delete the content ofComparitor Match Buffer FIGS. 13A-13C (22), activate the Match BufferTransport Control FIGS. 1A-1C (26) and command it to copy the content ofthe Comparitor Match Buffer FIGS. 13A-13C (23) into the Supermine™Document Review File FIGS. 1A-1C (38).

[0121] When the Match Buffer Transport Control FIGS. 1A-1C (26)completes copying the data in the Comparitor Match Buffer FIGS. 13A-13C(23) into the Document Review File FIGS. 1A-1C, (38) it sends a “taskcomplete” signal to the Receiver/Register Sensing Buffer FIGS. 13A-13C(19) that commands Template Control FIGS. 1A-1C (17) to delete thecontent of Comparitor Match Buffer FIGS. 13A-13C (23). Thereceiver/Register Sensing Buffer FIGS. 13A-13C (19) detects the emptyComparitor Match Buffers FIGS. 13A-13C (22-23) and sends a command tothe Match Buffer Transport Control FIGS. 1A-1C (26) to copy the last tenwords of the incoming document into the Receiver/Key Word Comparitorregister FIGS. 13A-13C (21). There the last ten words of the incomingdocument are compared with those in the Match Buffer Transport controlFIGS. 1A-1C (26) to verify that the entire document has been received.If the last ten words of both documents match, the Receiver/RegisterSensing Buffer FIGS. 13A-13C (19) sends a command to the IncomingDocument Transport Control FIGS. 1A-1C (27) to copy the content of theReceiver/Incoming Document Register FIGS. 13A-13C (18) into the documenttext segment of the Document Assignment Control buffer FIGS. 6A-6B (24)It also signals the Template Control Unit FIGS. 1A-1C (17) to delete theten words of the incoming document that were stored in the Receiver/KeyWord Comparitor FIGS. 13A-13C (21) to verify that the entire incomingdocument had been received.

[0122] The document text segment FIGS. 6A-6B (24-b) of the DocumentAssignment Control FIGS. 6A-6B (24) is the same type register as theAssignment Number Register FIGS. 6A-6B, (24-a) but has multiple, serieslinked segments with the capability of activating the next register inthe series when the first register approaches eighty percent fill withan incoming document. This design enables Supermine™ to accept verylarge documents or the complete texts of books. The document textregister segment (24-b) of the Document Assignment Control buffer FIGS.6A-6B (24) sends a “text received” signal to the Document TransportControl FIGS. 1A-1C (27) which then signals the Receiver/RegisterSensing Buffer FIGS. 13A-13C (19) that the document has been copied andto delete the content of the Incoming Document register FIGS. 13A-13C(18). The Receiver/Register Sensing Buffer FIGS. 13A-13C (19) senses theabsence of a document in the Receiver/Incoming Document register FIGS.13A-13C (18) and signals the Busy Transfer Switch FIGS. 1A-1C (12) thatReceiver No. 1 is idle and ready for another incoming document.

[0123] Referring to FIGS. 6A-6B, the Match Buffer Transport ControlFIGS. 1A-1C, (26) has copied the document header from the ComparitorMatch Buffer FIGS. 13A-13C (22) into the Assignment Number Register(24-a) of the Document Assignment Control FIGS. 6A-6B (24). At the sametime, the Incoming Document Transport Control FIGS. 1A-1C (27) alsocopied the text of the document into the Document Text register (24-b)of the Document Assignment Control (24). Referring to FIG. 4, the twoDocument Assignment Control registers FIGS. 6A-6B (24-a, 24-b) areidentical to the sixty-four bit Receiver/Comparitor/Match Buffers FIGS.13A-13C (22,23) and the register of FIG. 4. At this point the headerportion of the Document Assignment Control FIGS. 6A-6B (24-a) registercontains the same numerals as those used in previous FIG. 4explanations.

[0124] Referring to FIGS. 6A-6B (24), the two principal registers in theDocument Assignment Control FIGS. 6A-6B, (24-a, 24-b) are the AssignmentNumber Register (24-a) and the Document Text Register (24-b). They areidentical with the exception of size. In effect, they are “paired” in aseries/parallel configuration that enables the header and the documenttext to be copied, transported or accessed as a complete document or theheader can be accessed singly by accessing the first sixty four slots inthe Assignment Number Register. The purpose of this configuration is toenable Supermine™ to transport the content of the Document AssignmentControl FIGS. 1A-1C, (24) into the storage facility as one document withthe header occupying the first sixty-four bits of the register.Immediately following that operation it is necessary to copy only theheader of the document to an active inventory file and into an offpremise inventory file that is a duplicate of the on premise unit.

[0125] Referring to FIGS. 6A-6B, the following will describespecifically how the Document Assignment Control FIGS. 1A-1C, (24), sameas FIGS. 6A-6B, (24), moves the incoming document into the proper Nationstorage location in a manner to facilitate rapid identification andretrieval. In this same operation, the entire document, including theheader, is copied into an off premise back-up storage unit FIGS. 1A-1C,(39) which is a duplicate of FIGS. 6A-6B and 10A-10B. Referring brieflyto FIG. 4, notice the register positions of the location numerals storedin the Receiver/Comparitor Match Buffer (22). The numbers 057, alongwith the other location numerals, are the first numbers transmitted tothe Warehouse Storage Assignment Buffer FIGS. 6A-6B, (29).

[0126] Referring to FIGS. 6A-6B, the Warehouse Storage structures foreach Nation (folders) described in FIGS. 10A-10B are arrangedalphabetically across the mid-portion of the figure. There can be up to399 individual nation storage locations, each served by multipleWarehouse Storage Assignment Buffers FIGS. 6A-6B, (29) consisting of amultiplicity of sixty-four bit registers (only two of which are shownfor clarity). The lower registers are the Nation Storage Location NumberRegister/Connectors FIGS. 6A-6B (32). These registers have permanentconnections to the input of each Nation Storage unit FIGS. 6A-6B (15).Also FIGS. 10A-10B. For example: The input of storage location numberthree (003) for Australia is permanently connected to the (003) slots inthe lower register. The United Kingdom storage location is permanentlyconnected to the (051) slots and the United States storage location ispermanently connected to the (057) register slots. The Warehouse StorageAssignment Buffer FIGS. 6A-6B, (29) is unique in that each nation numberregister segment contains an electronic switch (similar to the PNPNtransistor switch used in Time Division Switching and many wave shapingfunctions) that is normally non-conducting. This controls the path intothe Nation Storage Locations (15) holding them non-conducting (open) sono data can be entered into the Nation storage Location via thatparticular register.

[0127] Referring to the circular insert on FIGS. 6A-6B, a pair of PNPtransistors is configured to operate as a very fast electronic switch.The negative side of the bias circuit for the switch is connected to thelocation slots of the Nation Location Storage Number Register/ConnectorFIGS. 6A-6B (32) and the positive side of the bias circuit is connectedto the Assignment Match Buffer Register FIGS. 6A-6B (33). The AssignmentMatch Buffer Register (33) is paired with the Nation Storage LocationNumber Register/Connector (32) in a manner to provide a transmissionpath into the assigned Nation storage location for the document beingstored, while holding the input path to all other storage locations open(non-conducting). This assures that the document will be inserted onlyinto the assigned storage location, and no other multiple of theDocument Assignment Control FIGS. 6A-6B (24) can enter that specificNation storage location while transmission is in progress. It should beunderstood that with this parallel type configuration other pairs ofWarehouse Storage Assignment Buffers FIGS. 6A-6B (29) could be activelytransmitting into other Nation Storage locations while this is inprogress.

[0128] The Document Assignment Control FIGS. 6A-6B, (24) commands theDocument Transport Control FIGS. 6A-6B, (28) to transmit the documentinto the Warehouse Storage Assignment Buffer FIGS. 6A-6B, (29). Thecontent of the Document Assignment Control FIGS. 6A-6B (24) begins tofill the Assignment Match Buffer register FIGS. 6A-6B, (33). When thenumerals “057” in the match buffer “match” those in the Nation StorageLocation Number Register/Connector FIGS. 6A-6B (32) there is a positiveenabling pulse applied across the bias circuit for the PNPN switch. Thisraises the “trigger” voltage causing it to conduct, thereby closing alow impedance transmission path from the Assignment Match Buffer FIGS.6A-6B, (33) through the “057” slots in the Nation Storage LocationNumber Register/Connector FIGS. 6A-6B, (32) and into Nation Storagelocation 057. This enables Supermine™ to insert the complete documentonly into Storage Location 057 that is, in reality, a huge computerfolder. The document, along with the Warehouse Storage Assignmentheader, must now be inserted into the Nation Storage Location “057” (15)by discrete fields of industry. The positional configuration of the datathat was inserted in the Receiver/Comparitor Match Buffer FIGS. 13A-13C(22) makes this much simpler.

[0129] For ease of explanation this disclosure assumed that alldocuments originating or pertaining to the United States were related toone of the fifty States. This may not be true in all circumstances orfor all Nations. However, the numbering scheme can be easily extended tomore than the fifty states if needed or location storage foldersestablished for other major segments of society in the US or the world.Referring to the Location Template of FIG. 2, in the storage unit forthe United States (057) there is a shelf (folder) for each state. Theseshelves are listed in alphabetical order with Alabama being first(057001) and Wyoming last (057050). Notice that Texas is the 43^(rd)State in Alphabetical order and carries the assignment (057043).Documents are stored on the warehouse nation shelves in the order of theStates plus any other shelves that carry numbers not associated with aparticular state. The registers associated with each shelf (folder) haveelectrical connections to a multiplicity of input transmission paths tothe Warehouse Storage Assignment Buffers FIGS. 6A-6B (29) that containthe paired Assignment Match Buffers FIGS. 6A-6B (33) and the NationStorage Location Register/Connectors FIGS. 6A-6B, (32). Each shelf(folder) for each state is further sub-divided into segments eachcontaining space for registers to contain documents relating to up to999 fields of endeavor ranging from Accounting through Zoology.

[0130] Referring to FIG. 2, this is the Location Template configurationthat is a “pattern” for the way the registers are configured within theStorage Locations. As stated before, the primary location column 1 ofthe location template represents the 399 freestanding storage locationunits. Column 2, the secondary location column, represents the State or,in the case of other nations, the Provinces. This column corresponds tothe horizontal shelves of the freestanding nation storage unit. Lookingat template 2 is the same as looking down on the top shelf of FIGS.10A-10B. This will give an idea of how the horizontal shelves are laidout.

[0131] Referring to FIGS. 6A-6B, the lower portion of FIGS. 6A-6Brepresents the freestanding shelf structure 057 for the United States.Shown inside the 057 structure are three examples of assignmentregisters that have been “copied” into the 057 shelf structure. Thefirst is for Alabama, the second for Texas and the third for Wyoming.Each state is assigned an entire “shelf” in this warehouse. For thisdisclosure the warehouse storage unit for the United States (057)contains fifty separate “shelves”, FIGS. 6A-6B (16).

[0132] Consider first the register shelf for the State of Texas, i.e.,the center shelf. Note that the register numerals in this example readright to left because they were inserted into the register in a specificsequence from left to right with the 057 numerals the first to betransmitted.

[0133] The location shown in the Assignment Register FIGS. 6A-6B (24-a)has the numerals 057 (US) in the first three slots. The numbers for theState of Texas (043) are in the next three slots. This means that thereis a large storage shelf (folder) for the State of Texas containingsufficient registers to store documents relating to all 999 fields ofindustry. In the example shown, the numerals 002 are in the first slotsof the Industry segment of the register. This is the number assigned tothe agriculture industry. All documents relating to agriculture in Texaswill be stored in this specific position on the Texas storage shelf(043). There are other registers on that same Texas shelf that willcontain a location 001 for Accounting, a location space (006) forstoring documents relating to Oil and Gas, a location 049 for tax dataetc., all stored in the specific industry position in the register thatconforms to industry column 1 of the industry template.

[0134] Referring to FIGS. 2 and 3, the configuration of the scan linesand columns on these templates will compare with the configuration ofthe storage locations within the shelf locations for the various States.In the example used, the first register shown is for the State ofAlabama and carries the numerals 057 plus the numerals 001 in thelocation slots. This identifies the document with Alabama and it will belisted first in the multiplicity of state registers. Now notice that thefirst three numerals in the Industry segment of the Alabama register are036. Now refer briefly to the assignments shown on column 1 of FIGS.5A-5B (30) and see that the numerals 036 are for the TransportationIndustry. This tells us that there will be a storage shelf location(folder) for all documents concerning the transportation industry in theState of Alabama along with as many as 998 other storage locations orfolders for other industries.

[0135] As a further example, the last register shown on FIGS. 6A-6Bcarries the numbers 057050 (right to left). This tells us that thisregister is for the State of Wyoming. The first three Industry slots forthis register carry the numerals 006. Now refer to the Industry locationassignments on FIGS. 5A-5B column 1 (30) to see that the numerals 006identify the Oil and Gas Industry. Wyoming will have an Oil and Gasfolder for all documents concerning Oil or Gas exploration, drilling,production etc. in the State of Wyoming. There will be a similar folderfor Oil and Gas in the folders for the States of Texas, California,Arkansas, Oklahoma and other states having oil and gas industries. Inour example of planting string beans in Pasadena, Tex. we previouslyexplained the assignment numbering system below the primary industrynumerals (002). The numbering system is the same for all industries. Theindustry identification of documents can be taken down to very finegrain detail by simply continuing the template numbering schemes. Itwill be possible to call up documents that describe the type drill bitneeded to drill for oil in Odessa, Tex.

[0136] In summary, Supermine™ provides a storage warehouse for allnations plus other major divisions as desired. Within that hugewarehouse, there are storage shelves (folders) for all states orprovinces or major divisions within that nation. Within the folder forall states there are individual folders for the 999 major divisions ofindustry or fields of endeavor. Each industry folder is furthersubdivided into the specifics and details of that industry or field oflearning or study.

[0137] When the complete document has been received, the DocumentAssignment Control FIGS. 6A-6B (24) commands the Document TransportControl FIGS. 6A-6B (28) to copy the document into the Warehouse StorageAssignment Buffer FIGS. 6A-6B, (29). From here the header and documenttext pass through the protective “gating circuitry” FIGS. 6A-6B, (33)and are inserted into the proper Nation Location Warehouse StorageLocation and the Industry Storage Location within that Nation Locationvia the Nation Location Storage Register/Connector FIGS. 6A-6B (32).When the Warehouse Storage Assignment Buffer FIGS. 6A-6B, (29) becomesempty, a “document received” signal is returned to the DocumentAssignment Control, FIGS. 6A-6B, (24). This prompts the DocumentAssignment Control, FIGS. 6A-6B, (24) to command the Document TransportControl FIGS. 6A-6B, (28) to create a “select all” command to copy thecontent of the Assignment Number Register FIGS. 6A-6B (24-a) (thedocument header) into the on premise Supermine™ inventory unit, FIG. 14.This is also FIGS. 1A-1C, (37).

[0138] The Supermine Inventory Unit FIG. 14 (37) returns a “documentreceived” signal to the Document Assignment Control, FIGS. 6A-6B, (24).This prompts the Document Assignment Control, FIGS. 6A-6B, (24) tocommand the Document Transport Control FIGS. 6A-6B, (28) to create a“select all” command to copy the content both the Assignment NumberRegister FIGS. 6A-6B (24-a) (the document header) and the Document Textregister, FIGS. 6A-6B (24-b) into the off premise Supermine™ StorageWarehouse FIGS. 1A-1C, (39). The off premise warehouse is a duplicate ofthe warehouse of FIGS. 10A-10B and is populated exactly the same asdescribed above. The remote Assignment Match Buffer (off premiseduplicate of FIGS. 6A-6B, (33)) returns a “document received” signal tothe Document Assignment Control, FIGS. 6A-6B, (24) which sends the“document received” signal to the on premise FIGS. 6A-6B, DocumentAssignment Control (24). This prompts the Document Assignment ControlFIGS. 6A-6B, (24) to signal the Document Transport Control FIGS. 6A-6B(28) to enter the Assignment Number Register (24-a) and to copy the64-bit header into the Supermine™ off premise Inventory Unit, FIGS.1A-1C, (37). The off premises inventory unit returns a “documentreceived” signal to the Document Assignment control FIGS. 6A-6B, (24),same as FIGS. 1A-1C (24). This triggers the Document Assignment Control,FIGS. 6A-6B, (24) to delete the content of the Assignment NumberRegister FIGS. 6A-6B (24-a) and the Document Text Register FIGS. 6A-6B(24-b) making them ready for another document.

[0139] These operations insure that every document stored the Supermine™Warehouse also has the header for that document stored in the Supermine™Inventory Unit, FIGS. 1A-1C, (39). In addition, every document stored inSupermine™ Warehouse has a copy of the document stored in the remote,off premise Supermine™ Warehouse FIGS. 1A-1C (34) and the off premiseInventory Unit FIGS. 1A-1C (39). The method of storing documents in, andretrieving documents from, these off premises back-up facilities isduplicated with those for the primary Supermine™ installation. The useof multiple components such as the Data Receiver/Key Word ComparitorFIGS. 13A-13C (11), Template Control Units FIGS. 1A-1C (17), DocumentAssignment Control Units FIGS. 6A-6B (24), and liberal provisioning ofregisters and buffers at all stages of the process enable Supermine™ toprocess incoming documents in parallel operation rather than serially.This results in a continuous document acquisition operation unhinderedby overloaded or busy components.

[0140] At this point of the disclosure we have described how Supermine™can insert a pre-prepared template into a polling unit programmed topoll the internet or other data sources, find a document relative to aspecific subject, receive the incoming document in any one of severallanguages, note the date received, assign a unique Supermine™ documentnumber, electronically “read” the document and convert key wordscontained in the document into fifty-nine numerals and symbols and usethem to store the document in a storage location for a specific nation.Further, we have explained how the documents can be indexed to identifythe industry and the specifics of the industry that are mentioned in thedocument. We have explained how to copy the document, along with theidentifying header, into a specific location in a specific State withinthe storage location. This is done without a human ever seeing orhandling the document.

The Document Retrieval Process

[0141] Referring to FIGS. 1A-1 to 1A-3, the Supermine™ documentretrieval process begins with the client who has the option ofpurchasing three levels of access. Referring to FIGS. 1A-1C, 7, and5A-5B, access level one enables clients to enter Supermine™ at theanalyst level and obtain assistance from the analyst or a staff memberin taking data from Supermine™. Access level two enables client accessto the analyst plus direct access to the programmable polling unit. Thislevel of access grants clients access to the Supermine™ search templatesand allows them to copy templates directly into the Search TemplateRegisters, FIGS. 1A-1C, (4). These templates “trigger” the programmablepolling unit FIGS. 1A-1C, (1) to search the Internet and other sourcesfor documents that “match” the key words of the search templates. Theresulting documents are processed into Supermine™ storage as previouslydescribed. The method of delivering those documents to the client,analyst or staff member who requested the document is described in theretrieval process. Access level three is virtually unlimited access toSupermine™ including direct access to analyst assistance, theprogrammable polling units and the capability of retrieving documentsdirectly from Supermine™ warehouse as described in this disclosure.

[0142] This disclosure teaches an efficient means of electronicallycommunicating with Supermine™ to specify the data needed. The timerequired to locate, access and retrieve documents is greatly reduced byusing questionnaires that lead the client, Supermine™ staff or analystto reduce the data request to concise key words, in a specific order,that describe the essentials of the data needed. This eliminates themajority of human intervention when clients desire access or storeddata. We have previously explained that the information contained in theSixty-Four Bit Comparitor Match Buffer FIGS. 6A-6B (22) (the documentheader) has been made an integral part of the document that wasprocessed through the Supermine™ system. The document was stored inspecific Supermine™ locations using the location and industry numbersthat were developed in the storage process. This enables Supermine™ toretrieve the document by “finding” any combination of those assignmentfeatures.

[0143] Referring to FIG. 4 and to the six segments of the Sixty-Four-BitComparitor Match Buffer FIGS. 13A-13C (22), the Supermine™ system isdesigned to provide several ways to select the document for retrieval.These are:

[0144] 1. By the Nation, State, City, Community or specific locationnumbers (i.e., the first fourteen slots in the register).

[0145] 2. By the Industry, and various functions of that industry (i.e.,the next fourteen slots in the register).

[0146] 3. By the Supermine™ document number (i.e., the next eleven slotsin the register).

[0147] 4. By the number of the industry Template that was used in theReceiver Key Word Comparitor Register (21) to “read” and select theincoming document for storage (i.e., the next eleven slots in theregister).

[0148] 5. By the date the document was received if used in combinationwith other identifying data. If only date received slots are used forretrieval, it could result in an output of all documents that wereacquired and stored on the date specified. The five vacant registerslots shown as spare on the FIG. 4 drawing are used in some applicationsfor client or Supermine™ employee identification numbers or to determinethe language used in the incoming document.

[0149] These fifty-nine numerals and symbols being inserted intopredetermined positions in a specific order enable Supermine™ to usemathematical probabilities to sharply reduce document retrieval time andto exponentially increase the success rate of document retrieval.Clients, Industry Analyst and Supermine™ staff members access Supermine™at all hours, seven days/week/fifty-two weeks/year from all over theearth using a wide variety of computer protocols, program platforms andlanguages. In high speed, high volume data transactions it is essentialthat client retrieval time per document be held as low as possible. Inaddition to using mathematical probabilities, Supermine™ also utilizesunique circuitry and hardware configurations to reduce documentretrieval time by reducing scanning time in look-up tables etc.

[0150] Block Diagram FIGS. 1A-1 to 1A-3 describes how documents storedin Supermine™ Warehouse are identified, located and retrieved from theWarehouse Storage Units and made available to clients, Supermine™ staffand Analysts. A primary objective of Supermine™ is to automate, insofaras possible, the document retrieval process. This begins with providingclients and staff members with a questionnaire that will enable them toinput their requests for data directly into Supermine™. For a betterunderstanding of this procedure a description of the design and use ofthe questionnaire is in order at this point in the disclosure.

[0151] Referring to FIG. 7, the questionnaire is designed to obtainwords from the clients that are commonly used in the industry or fieldof knowledge or the area of commerce he is interested in. The analystwho prepared the templates and the questionnaires are experts in thoseindustries and are intimately acquainted with their terminology. Theyprepared the search templates and questionnaires using key wordsspecific to those industries. In the retrieval process these key wordsare converted into numerals that are inserted into the variousregisters. In addition, the search templates and the questionnaires aredesigned to elicit answers in the same order that the resulting numeralsare inserted into those registers. FIG. 7 is a prototype of a datasearch questionnaire that clients, staff and analyst use to search fordocuments containing information they need. It is designed to reducelanguage barriers and verbal communication between Supermine™ analystsand clients retrieving documents from Supermine™.

[0152] When answered, the questionnaire will provide sufficientinformation to enable Supermine™ to retrieve any document stored inSupermine™. The information requested by FIG. 7 is self-explanatory withthe exception that the order of the questions is important in that thisanswer sequence is used to prepare a template with the words in specificregister slots. These words are then converted into numerical retrievaltemplates. It should be understood that the templates and questionnaireare not limited to typing the answers into the Retrieval Search TemplateFIGS. 8A-8B (34-a). The document retrieval process can also be voiceactivated, i.e. “Please say the name of the industry you are interestedin now,” or controlled by a mechanized answering process, i.e. “If youwant a document that originated in the US press one”, in the State ofOhio press two etc. When voice recognition has progressed sufficientlyto overcome national and regional dialects and widely differing speechpatterns, the Supermine™ system will make use of that technology.

[0153] Referring to FIG. 7A, every client data request carries aclient's account number. While not shown on the FIG. 7A example, everySupermine™ analyst and staff member is also assigned an individualidentification number with the same number of digits as the client'saccount number but having a different number series. These digits areplaced in the same FIG. 7A position as the client's account number. Theclient's answers to the FIG. 7 questionnaire (FIG. 7A) are entered intothe Retrieval Search template Polling and Match segment FIGS. 1A-1 to1A-3, (34-a), (also FIGS. 8A-8B) where they are converted into numeralsusable as a template to search the warehouse storage unit, FIGS. 10A-10Bfor the desired document.

[0154] Retrieving documents from Supermine™ warehouse begin with theLocation and Industry templates. Referring to FIGS. 2 and 3, notice thatthe template column numbers correspond to the numeral position slots inthe Comparitor Match Buffer FIGS. 13A-13C (22). The column numbers alsoconform to the register slots of FIG. 4. In addition, a specific scanline identifies each major aspect of the template. This technique isused to get the numerals in the correct register slots in the event anincoming document contains a reference to duplicate, or to only one,aspect of the location or the industry. The importance of programmingSupermine™ using the columns of the template and the scan line positionsto prevent inserting location or industry numerals out of sequence inthe registers will become apparent as the retrieval disclosureprogresses.

[0155] Referring to Block Diagram FIGS. 1A-1C, the client's request fordata enters Supermine™ through the protocol converters (6) into thePassword gate of the Security Screening unit (7) that is programmed torequire the client's password. When the password is properly entered,the security program passes the data request to the Account Numbersegment of the Security Unit (7). The Security program requests theclient's Supermine™ account number. Encoded in this number are digitscorresponding to the level of access the client has purchased. Refer tothe examples of the client's account numbers shown on FIGS. 1A-1C (7)and notice the arrangement of the account digits. For simplicity, thehighest, most significant digit is for level three (00000332). When thenumber three is in the most significant digit position in the accountnumber the client has purchased access level three. With thisconfiguration there could be many levels of access to differentSupermine™ services depending upon the market demand. There are numerousencoding schemes that could be used in conjunction with the accountnumber to denote the access level purchased by the client. The clientdata request is passed to the Client Access Level Port (5) thatdistributes the incoming service requests to the proper Supermine™ inputdevice depending upon the access level purchased.

[0156] The access paths for the three levels of entry into Supermine™will now be described. Supermine™ staff and industry analyst areassigned level three and given discrete identification numbers in lieuof an account number. With that exception, data requests from Supermine™employees are processed the same as data requests from level threeclients. If question three of the Supermine™ data request is answered“Level 1” by the client, the data request is passed directly to aSupermine™ industry analyst who uses the address provided by the clientto contact and assist the client either directly by voice or by theinternet address.

[0157] If the client has answered “Level 2” for question three,requesting direct polling capability, the request is routed into theSearch Template segment, FIGS. 1A-1C, (4) of the Programmable Pollingunit FIGS. 1A-1C (1). Copying the key words provided by the clientdirectly into the polling unit results in a direct poll of the Internetor other sources to search for the data requested. The returneddocuments, if any, are processed through the Supermine™ system asdescribed in this explanation.

[0158] Referring to block diagram 1A, (34), if the client answers “Level3” on question three of the Supermine™ data request questionnaire FIG.7A the Client Access Level Unit FIGS. 1A-1C (5) directs the request intotwo parallel paths: The first path is to the Supermine™ DocumentDistribution Control, FIGS. 1A-1 to 1A-3, (41) or FIG. 11 that contains(among other units) the Supermine™ Data Request File FIGS. 1A-1 to 1A-3,(40) (also FIG. 11). This file is composed of a multiplicity of pairedregisters. One register will accept and contain the client's datarequest in its original form. The other member of the pair will containthe client's FIG. 7A data request after converting the words intonumerals has modified it so it can be compared with the pre-preparedtemplates, FIGS. 2 and 3.

[0159] Each Data Request File, FIGS. 1A-1 to 1A-3, (40) containscircuitry and programming to insert incoming data requests into anumbered file arranged to offer stored data requests for comparison in a“first-in-first out” arrangement. In addition, circuitry and programmingis provided to prevent selecting and sending the contents of a datarequest register to the Data Request/Received Document Comparitor FIGS.1A-1 to 1A-3, (41-b) or FIG. 11 a second time during the comparisonprocess. Once the register has been selected and “no match found”, thatdata request is “marked” and made busy to further comparison until allother stored data requests have been compared. The comparitor functionof the Data Request File is described later in the disclosure.

[0160] The second path for the client's Data Request is to the DocumentRetrieval Control FIGS. 8A-8B (34), the same as FIGS. 8A-8B. That unittransmits the client's data request into the Retrieval Search Templatesection, FIGS. 8A-8B (34-a) which is very similar to the Search TemplateRegister (4) of FIGS. 1A-1C. However, this search template is programmedto search the Supermine™ warehouse instead of the Internet. TheRetrieval Document control Template FIGS. 8A-8B (34) is a polling andscanning unit that returns a prompt to the client, analyst or staffmember leading them to type the data request FIG. 7A into the RetrievalTemplate Format section (34-b).

[0161] For purposes of the document retrieval disclosure, assume theclient or Supermine™ staff person typed the data request answers of FIG.7A into the Retrieval Search Template (34-b) of FIGS. 8A-8B. The keywords from the client's data request FIG. 7A are copied by the RetrievalSearch Polling and Match segment, FIGS. 8A-8B (34-a) into TemplateFormat section FIG. 8A-8B (34-b) that receives the key words from theclient questionnaire FIG. 7A and places them in the proper registerslots to conform to the Supermine™ template format. The order of thequestionnaire FIG. 7A replies conforms to the order of placement of theKey Words in the template Format Section FIGS. 8A-8B (34-b). TheTemplate Format section (34-b) is actually a large register formatted insix parts to correspond to the positions of the numerals of theComparitor Match Buffer of FIGS. 13A-13C and the Assignment Match bufferof FIGS. 6A-6B (33).

[0162] The first thirteen slots plus one space of the Template FormatRegister FIGS. 8A-8 (34-b) are for the Nation/State/City/Communitylocation numerals. The second thirteen slots plus one space is for theindustry numerals. The next ten slots plus one space is for theSupermine™ document number. The next ten slots plus one space is for theIndustry template number that was used to retrieve the document. Sincethe date received is already recorded in the header of the requesteddocument, the eight date-received slots in the register are not neededat this point of the retrieval process. For that reason they are usedfor the account number of the client or the identity number of theindustry analyst or the staff employee. In this operation the sixth partof the register, the normally unused slots, is for recording thelanguage used in the document.

[0163] As the words are received, they are placed in the proper orderand format by the Template Format section FIG. 8A-8B (34-b) of theDocument Retrieval Control Unit FIGS. 1A-1 to 1A-3 (34) or FIGS. 8A-8B(34). Any words that refer to the Nation, State, or Province, City orspecific location are “read” into the location section of the register.Industry words are inserted into the industry segment of the register inthe proper order. Since there is no word-to-numeral translation neededfor the Supermine™ document number and client account or staff identitynumber they are inserted directly into the Retrieval Template RegisterFIGS. 8A-8B (34-c) when received by the Retrieval Search Polling andMatch segment FIGS. 8A-8B (34-a). The person requesting the documentwill input the industry template number directly into the RetrievalTemplate if that number is known at the time of the request. There will,however be times when the industry template number is not known inadvance and must be derived by using the FIG. 7A questionnaire.

[0164] Referring to FIGS. 8A-8B, when the words from the answers to thedata request form FIG. 7A have been inserted into the Template FormatSection FIGS. 8A-8B (34-b) of the Document Retrieval Control Unit FIGS.8A-8B (34) a “word complete” signal is sent from the Template FormatSegment of (34-b) to the Search Template Segment (34-a) of FIGS. 8A-8B,(34.). This prompts it to poll the Location and Industry Key WordTemplate Storage Unit FIGS. 5A-5B. (8) searching for location andindustry templates with words matching those in the Template Formatsegments of FIGS. 8A-8B (34). In this search process, the SearchTemplate FIGS. 8A-8B (34-a) sends each word into the Location andIndustry Key Word Template Unit FIGS. 5A-5B and receives the numeralthat corresponds to that word. The numeral is placed in the correctregister slots in the Retrieval Template Register FIGS. 8A-8B, (34-c).For example, the Search Template FIGS. 8A-8B (34-a) transmits the word“US” into the location portion of the Location and Industry Key WordTemplate Storage Unit FIGS. 8A-8B and receives in return the numerals057. These numerals are stored in the Nation position of the retrievalTemplate Register FIG. 8A-8B (34-c). When the Nation slots are filled a“nation slots filled” signal is sent from the Retrieval TemplateRegister, FIGS. 8A-8B, (34-c) to the Search Template FIGS. 8A-8B (34-a)then transmits the word “Texas” into the Location and Industry Key WordTemplate Storage Unit FIGS. 5A-5B. The scanner in that unit works downthrough the alphabetically listed state templates and on the scan linefor Texas, finds a match and inserts the numerals 043 in the stateposition of the Retrieval Template Register FIGS. 8A-8B (34-c). The sameaction takes place for the words Houston, and Pasadena. When complete,the Retrieval Register FIGS. 8A-8B (34-c) will contain the numerals05704300401 in the location register slots.

[0165] When all location slots are filled in the Retrieval TemplateRegister FIGS. 8A-8B, (34-c) a “location template complete” signal issent to the Search Template FIGS. 8A-8B (34-a) that opens the industrysection of the Retrieval Template Register FIGS. 9A-9B (34-c) and sendsthe numerals 057 into the Location and Industry Key Word TemplateStorage unit FIGS. 5A-5B (8). It will poll the location numbersregisters looking for a matching numeral 057 that was inserted by theanalyst into the location numbers portion of the Location and IndustryKey Word Template Storage, FIGS. 5A-5B. The analyst had previouslyinserted the 057 at the language column that matched the language usedto construct the template. Referring to FIGS. 5A-5B, the Location andIndustry Key Word Template Storage Unit, FIGS. 5A-5B is a huge look-uptable that contains three interconnected segments. It is programmed torespond with corresponding numerals each time a search is made usingwords. It will also respond with corresponding words each time a poll ismade using numerals.

[0166] There are two methods provided for finding the correct languagefor the document search. If the client types the word “English” into theRetrieval Search Template FIGS. 8A-8B (34-a) indicating the language hewants is English, the word “English” is stored in the language sectionof the Template Format section of FIGS. 8A-8B, (34-b) i.e. the lastthree slots. In this case, when the location numerals have beenretrieved from the Location and Industry Key Word Template Storage UnitFIGS. 5A-5B, the Search Template FIGS. 8A-8B, (34-a) sends the word“English” into the template storage unit (FIGS. 5A-5B) and receives thenumerals 01 in reply which is column 01 of all templates. This column isreserved on all templates for the English language. These numerals aretemporarily “set” in the language slots in the Template Format sectionof FIGS. 8A-8B, (34-b) and are transmitted with each word that is sentinto the Location and Industry Key Word Template Storage Unit FIGS.5A-5B. This forces all industry word searches to be made only in column1 of the storage unit that is reserved for the English language. If thematch had been found in column 2, the numerals 02 would have beeninserted in the language section of the Template Format section FIGS.8A-8B (34-b) and all searches would be made in the Spanish language etc.

[0167] The second method for determining the language is to be used whenthe client does not indicate what language is used in the data he issearching for. It can also serve as a verification of the other languagesearch method if there is any doubt about the language. The SearchTemplate, FIG. 8, (34-a), is programmed to transmit the nation numerals057 into the location numerals register of the Location and Industry KeyWord Template Storage Unit (FIGS. 5A-5B) as soon as all locationregister slots are filled in the Retrieval Template Register FIG. 8,(34-c). As mentioned previously, when the industry analyst completed the“US” templates in the English language, he copied the numerals 057 intothe nation numerals segment of the Location and Industry Key WordTemplate Storage Unit FIGS. 5A-5B, at column 01 which is reserved forthe English language.

[0168] When the Search template FIG. 8, (34-a) transmits the numerals057 into the Location Numbers Register of the Location and Industry KeyWord Template Storage Unit FIG. 5, the numbers 057 find a match inlanguage column 01. This stops the polling of the language columns atcolumn 01 and a “match signal” is sent to the Search Template FIG. 8.(34-a) which “sets” the numerals 01 into the Retrieval Template RegisterFIG. 8, (34-c) at the language segment slots that were never closed. Itcannot enter the 01 column number in the industry position because atthat time the industry slots are still closed. When the “locationtemplate complete” signal was sent from the Retrieval Template RegisterFIG. 8, (34-c) to the Search Template (34-a), that signal was also usedby the Search Template to open the industry register slots in theRetrieval Template Register FIG. 8, (34-c). When the signal was receivedthat the industry slots were opened, the Retrieval Search Template FIG.8, (34-a) began polling column 01, the English language column, of thetemplate storage unit FIGS. 5A-5B (8) looking for a word match betweenthe words provided by the client that were stored in the industrysegment of the Template Format Section (34-b) and the industry templatesstored in the Location and Industry Key Word Template Storage Unit FIGS.5A-5B (8).

[0169] The Retrieval Search Template FIGS. 8A-8B (34-a) transmitted theword “agriculture” into column 01 of the Location and Industry Key WordTemplate Storage Unit FIGS. 5A-5B (8). On the first scan, since 001represents accounting templates, no match was found. On scan 2, a matchwas found for the word “agriculture” and the numerals 002 were returnedand inserted into the industry slots of the Retrieval Template RegisterFIG. 8, (34-c). The same action is repeated for the words crops, stringbeans, soil and planting, with the resulting numerals stored in theRetrieval Template Register FIG. 8, (34-c) in the same order as those onthe Supermine™ standard register column assignment scheme of FIG. 4.Referring to FIGS. 5A-5B, for drawing clarity, only the IndustryTemplate primary column numbers are shown, i.e. 002. Refer to FIG. 3,the Industry Template, for an understanding that all industry columns,1, 1A, 2, 3, 4, 5 are included in FIGS. 5A-5B, English column 01 andsubsequent language columns 02 through “N”. If the language used on thereturned Supermine™ data search questionnaire FIG. 7A had specified theSpanish language, no match would have been found in the search of column01 and the scan would have proceeded to column 02 and all words returnedin the Spanish language. When the industry slots are filled, an“industry Template complete” signal is sent from the Retrieval TemplateRegister, FIGS. 8A-8B, (34-c) to the Retrieval Search Template, FIGS.8A-8B, (34-a). That signal is used to open the Client Account numberslots in the Retrieval Template register FIGS. 8A-8B, (34-c).

[0170] When the Retrieval Search Template FIGS. 8A-8B, (34-a) senses theopen Retrieval Template Register slots in the account number positions,it signals the Template Format Section FIGS. 8A-8B (34-b) to move theclient's account number into the account number slots of the RetrievalTemplate Register FIGS. 8A-8C, (34-c). Note that Supermine™ does notknow the Supermine™ document number at this time, and the document weare searching for already has the Supermine™ document number embedded inits header and therefore it is not needed at this point. Also note thatthe vacant Supermine™ document number and Industry template number areused when the client or staff inputs these numbers directly into theRetrieval Search Template FIGS. 8A-8B, (34-a). The numbers are movedinto the Retrieval Template Register FIGS. 8A-8B, (34-c) as previouslydescribed. This provides a very efficient document retrieval process forSupermine™ Staff, Analyst and selected clients.

[0171] A timing sequence is built into the Retrieval Template RegisterFIGS. 8A-8B, (34-c) in the event the client does not answer all FIG. 7Aquestions. When no additional numerals are received into the RetrievalTemplate Register FIGS. 8A-8B, (34-c) after five milliseconds, a startsignal is sent to the Retrieval Document Transport, FIGS. 8A-8B, (35).This start signal causes the Retrieval Document Template Transport FIGS.8A-8B (35) to copy the content of the Retrieval Template Register FIGS.8A-8B (34-c) into one of a multiplicity of Scan Control/ComparitorsFIGS. 9A-9B (36).

[0172] The data request must be associated with the retrieved documentto deliver it to the proper requesting party. This is made possible bycreating a parallel path into the inputs of the Scan Control ComparitorsFIGS. 9A-9B (36-a) and the Data Request file of FIG. 11 (40). That fileis composed of paired registers. The first register already contains theclient's original data request that was inserted at the same time therequest went into the Document Retrieval Control, FIGS. 8A-8B (34). Thesecond register contains the client's data request as modified byconverting words into numerals as just explained. This puts the clientrequest in the same format as the search made in the document warehouseto assure associating the retrieved document with the correct datarequest. When the document is retrieved from the Supermine™ warehouseFIGS. 10A-10B and 6A-6B it is returned to the Document DistributionControl, FIG. 11 (41) where it is matched with the Data Request RegisterFIG. 11 (40-b) that carries the modified document request.

[0173] The Scan Control/Comparitors FIGS. 9A-9B, (36) are designed tospeed document retrieval by reducing the scan and match time required toselect a document from the Nation storage location FIG. 10 (15) andFIGS. 6A-6B and deliver it to the Supermine™ Document DistributionCenter FIGS. 12A-12B (42). Referring to FIGS. 9A-9B (36), the RetrievalDocument Transport FIGS. 8A-8B (35) copies the content of the RetrievalTemplate Register FIGS. 8A-8B (34-c) into the input of ComparitorRegister FIGS. 9A-9B (36-a). This unit is a component of the ScanControl/Comparitor FIGS. 9A-9B (36). The numerals shown in that registerare those derived from the information provided by answers provided bythe staff, client or analyst when the data search questionnaire answers,FIG. 7A were inserted and converted into numerals.

[0174] As the Comparitor Register of FIGS. 9A-9B (36-a) begins to fill,the Scan Control segment of FIGS. 9A-9B (36) senses the insertion ofdigits corresponding to column 01 of the location templates (see FIG.3). As the numerals 057 fill the location column 01 position, the activenumerals in the three slots are detected by the location segment of theColumn Scan Control in the Scan Control/Comparitor FIGS. 9A-9B (36) andare copied into the Assignment Number Register FIGS. 6A-6B 24), theninserted into the Assignment Match Buffer FIG. 6, (33). This starts theprocess of selecting the proper Warehouse Nation Storage Location FIG.6, (15). When the numerals 057 find a match in the three slots of theNation Location Number Register/Connector FIG. 6, (32) the connection iscompleted (as previously described on FIGS. 6A-6B) from the 057 StorageUnit (for the US) to the Column Scan Control FIGS. 9A-9B (36). Thelocation segment of the Scan Control/Comparitor FIGS. 9A-9B (36) copiesthe numerals 057 into the Document Match Register FIGS. 9A-9B (36-b).The process of elimination has begun. Only documents originating orrelating to the US will now be considered because documents on othernation storage shelves have been eliminated. However, the document weseek is still only one of hundreds of thousands that could be in any oneof the fifty States in the 057 storage structure.

[0175] Continuing to refer to FIGS. 9A-9B as the numerals 043 for theState of Texas fill column 02 of the Scan Control Comparitor RegisterFIGS. 9A-9B (36-a) the change in state of the 043 column slots isdetected by the location segment of the Scan Control FIGS. 9A-9B (36)and the numerals 043 are copied into the Assignment Match Buffer FIG. 6,(33) that is connected to the 057 warehouse storage shelf. The scannerthen “steps” over to the column 2 registers, begins scanning at the scanline for 057 and scans the registers on shelf (folder) 057 in the column2 position looking for a matching number 043. When a match is detected,the location scan control segment of Scan Control/Comparitor FIGS. 9A-9B(36) copies the numerals 043 into the Document Match Register FIGS.9A-9B (36-b) at scan line 02. We have now eliminated all documents instates other than Texas. However the document we seek is still one oftens of thousands.

[0176] The Supermine™ system uses mathematical probabilities to speeddocument retrieval. Supermine™ is a high volume, high-speed processmaking it necessary to hold document retrieval time as low as possible.This need is met by the invention of the “Match Count Copy Trigger” thatenables Supermine™ to use mathematical probabilities to speed documentretrieval and to control the scope of the document search. Thisinnovation is shown schematically on FIGS. 9A-9B, (36-c). It is madepractical because of two important design characteristics of Supermine™:

[0177] 1. The conversion of all words and letters into numerals andplacing them into specific slots in all registers.

[0178] 2. Designing the headers and registers to correspond to the scancolumns and scan lines of the Location and Industry templates. In thisway, specific locations and specific aspects of industries are always intheir own unique slot positions in all registers and headers.

[0179] For an example of this feature refer briefly to FIG. 4. Noticethat (left to right) there are thirteen numbers in the first thirteenregister slots plus one spacing slot. These fourteen numbers representthe specific location of Pasadena, Tex. as previously explained. Thenext thirteen slots plus one spacing slot represent the requirements forgetting a good string bean crop in Pasadena, Tex. The next ten registerslots plus one spacing slot represent the Supermine™ document numberwith the next ten slots representing the industry template number usedto identify and begin storing the document. Any of these segments can beused to retrieve a document.

[0180] Referring again to FIGS. 9A-9B, notice that the RetrievalTemplate Transport FIGS. 8A-8B, (34-c) has “copied” the header numbersinto the registers of FIGS. 9A-9B, (36-a) in the exact order that theyappear in the search template header and on the document being sought.This is the template that tells Supermine™ where to look for the neededdocument. The Scan Control Comparitor FIGS. 9A-9B, (36) will use thenumbers stored in FIGS. 9A-9B (36-a) to search through the Supermine™warehouse looking for a set of numbers that match those of the searchtemplate stored in FIGS. 9A-9B, (36-a). When a matching number is found,it is copied into its proper slot in the Document Match Register FIGS.9A-9B, (36-b). Note that, for drawing clarity, only the location andindustry segments of the header are shown in FIGS. 9A-9B (36-a) and(36-b). In actual operation there will also be a segment for theSupermine™ document number and the Industry template number, and ifdesired, a segment for the date received. These three added segments toFIGS. 9A-9B, (36-a) and (36-b) will permit retrieving documents usingonly the Supermine™ document number or the industry template number. The“Match Count Copy Trigger” (FIGS. 9A-9B, (36-c) will have sensorsconnected to each scan column of each of these segments.

[0181] Referring to FIGS. 9A-9B, (36-c), the “Match Count Copy Trigger”has sensors connected to all twenty-four numeral slots in the DocumentMatch Register FIGS. 9A-9B, (36-b). As the scan progresses, everymatching number found on the stored document is copied into DocumentMatch Register, FIGS. 9A-9B, (36-b). As each register slot is filled,the probability that the document being scanned is the desired documentincreases exponentially with every number added. A mathematician wouldlook at the twenty four numbers in the Comparitor Register, FIGS. 9A-9B,(36-a), each to the base 10, and compare them with those same numbers inthe Document Match Register, FIGS. 9A-9B, (36-b). At each point in thescan and match process the probability of the two documents having thesame header can be calculated.

[0182] Refer to FIGS. 9A-9B, (36-c) and notice the control lead comingfrom the control analyst into the match count trigger FIGS. 9A-9B(36-c). This control allows the analyst to set the “Match Count CopyTrigger” point at the desired percentage of the probability that the twodocuments are the same. Reaching that probability percentage settingwill “trigger” the Match Count Trigger, FIGS. 9A-9B, (36-c) to commandthe Column Scan Control to stop the comparison scan and copy all 24numbers of the stored document header into the Document Match RegisterFIGS. 9A-9B, (36-b). This configuration allows the analyst to adjust thecontrol program in the “Match Count Copy Trigger” to cause it to triggerwhen one, two or three segments have matching numbers, or on the overallprobability of all segments that the two documents are the same. Thisnot only reduces scan and match time of the retrieval process by anestimated thirty percent, but enables the analyst to control the scopeof the search and the amount of detail collected with no degradation ofdocument selection accuracy.

[0183] The above process continues with the numerals 004 for the city ofHouston, Tex. being inserted and a scan made of the column 3 positionsof the registers. A match is found, and the numerals 004 inserted intothe Document Match Register FIGS. 9A-9B (36-b). The same operation iscompleted for the community of Pasadena, Tex.=01. It should beunderstood that this technique can be used to identify a specific officeon a specific floor of the Pasadena, Tex. State Bank or a specific houseon a specific street in Pasadena, Tex. by simply adding digit slots tothe location registers.

[0184] At this point we have increased the probability of locating thespecific document we need by eliminating all locations other thanPasadena, Tex. If Pasadena, Tex. were a large farming location, ourdocument could still be one of hundreds or even thousands. As statedearlier, not all clients will provide a perfect reply to the Data SearchQuestionnaires FIG. 7. There will be searches made that will have only afew slots filled in the registers. For that reason there will be timeswhen all available information contained in the document must be used.To speed document selection by reducing scan time and to furtherincrease the probability that the document in the 057 warehouse storageunit being scanned is the one we seek, the order of filling the industryslots in the Comparitor Register FIGS. 9A-9B (36-a) is reversed. Theindustry specific slots 06,05,04, etc. are filled first. This causes theScan Control/Comparitor FIGS. 9A-9B (36) to scan these slots first, i.e.in reverse order. Instead of scanning through the broad industrycategories and working down to the industry specifics, the Scan ControlComparitor FIGS. 9A-9B (36) now “looks” first at the specifics of thedocument. This greatly reduces scan time and increases the probabilityof selecting the desired document much earlier.

[0185] When all corresponding slots match, both filled and empty, theColumn Scan Control segment of the Scan Control/Comparitor originates a“select all” command and enters the register selected in the 057 storagelocation and copies the entire document, both header and text, into theholding registers of Retrieval Document Transport of FIGS. 8A-8B (35).If no additional text data is received after a five millisecond delay,the Retrieval Document Transport FIGS. 8A-8B (35) originates a “selectall” command and copies the header and text into the Retrieved DocumentRegisters, FIG. 11, (41-a) of the Document Distribution Control Unit,FIG. 11 (41).

[0186] Referring to FIG. 11, the Retrieved Document Registers FIG. 11,(41-a) are in a series, parallel configuration that enables the headerand text to be selected and copied together or in separate segments.This permits selecting the first 64 bits that comprise the header of thedocument and comparing those bits with the modified clients request thatis stored in the Data Request File FIG. 11, (40) in the client'smodified request register. After a five-millisecond delay to assure thatthe entire document was received, the Retrieved Document Registers, FIG.11, (41-a) sends a “document received” signal to the DocumentDistribution Control Unit, FIG. 11, (41). This unit is programmed toreturn a command to the Retrieval Document Transport FIGS. 8A-8B, (35),to delete the copy of the document in the Retrieval Document TransportFIGS. 8A-8B, (35). This was the document that was transported (copied)to the Document Distribution Control FIG. 11, (41). It then activates aprogram that will copy the 64-bits of the Retrieved Document registerFIG. 11, (41-a) (the document header) into the Retrieved Document HeaderRegister segment of the Document Request/Retrieved Comparitor FIG. 11,(41-b). This begins the process of comparing the retrieved documentheader with the client's request that was modified to conform to theSupermine™ document storage numbering system.

[0187] When the document is received a signal is returned to theDocument Distribution Control Unit FIG. 11, (41). This signal activatesa program that enters the Data Request File, FIG. 11 (40) that containsthe modified client data requests, selects the oldest data request onfile, and copies it into the Data Request Register segment of the DataRequest/Received Document Comparitor FIG. 11, (41-b). The Comparitor isactivated by a “document received” signal from the Retrieved DocumentHeader segment of the Comparitor (41-b). It is programmed toelectronically compare the header of the retrieved document with themodified data request register in the Data Request segment of thecomparitor.

[0188] If no match is found, a signal is sent from the Comparitor FIG.11 (41-b) to the Document Distribution Control, FIG. 11, (41). TheDocument Distribution Control FIGS. 1A-1 to 1A-3, “marks” the first DataRequest selected (makes it unavailable to the scanner) and enters theData Request register segment of the Comparitor FIG. 11 (41-b) anddeletes the data stored there. The Data Request Register uses the emptyData Request register as a signal to enter the Data Request File, FIG.11, (40) and copy the next oldest Data Request into the Data RequestRegister of the comparitor FIG. 11, (41-b). When the “data received”signal is returned to the Document Distribution Control, FIG. 11, (41)the next comparison of the Data Request numerals with those of theRetrieved Document header begins. This comparison, deletion, insertion,comparison loop continues until a match is found.

[0189] When the match between the header of the retrieved document andthe modified client data request is found, a “match found” signal issent to the Document Distribution Control FIG. 11, (41) which uses thatsignal to activate the control program to enter the Data Request File,FIG. 11 (40) and select and copy the active data request register FIG.11, (40-a) into the Document Distribution Center FIGS. 12A-12B, (42).This is the client's original unmodified FIG. 7A data request. TheDocument Distribution Center FIGS. 12A-12B, (42) has registers connectedin a series, parallel configuration that enables the content of tworegisters to be combined or transmitted singly. The DocumentDistribution Center FIGS. 12A-12B, (42) returns a “document received”signal to the Document Distribution Control FIG. 11, (41) which usesthat signal to activate the control program to enter the RetrievedDocument File, FIG. 11, (41-a) and select and copy the contents of thatregister into the Document Distribution Center FIGS. 12A-12B, (42). TheDocument Distribution Center FIGS. 12A-12B (42) returns a “documentreceived” signal to the Document Distribution Control FIG. 11, (41)which uses that signal to delete the content of the Data Request Fileregisters FIG. 11, (40-a and -b) and makes them available for the nextclient data request.

[0190] The Document Distribution Center FIGS. 12A-12B, (42) contains acontrol program that combines the copy of the original client's datarequest with the retrieved document. It also records the number of theCombined Data Request/Retrieved Document register that stored thedocument and data request in the Document Distribution Center registersFIGS. 12A-12B (42-a and 42-b). The program then sends the client datarequest, in its original form, and the number of the registers thatcontain entire retrieved document into a visual display. This is toenable the Document Distribution Clerk to enter the correct CombinedData Request/Retrieved Document Register FIGS. 12A-12B, (42) anddownload the content of those registers to the client. In addition,provision is made in the program to activate an audible/visual signal toalert Supermine™ staff personnel to down load the client's data requestand the retrieved document to the client.

[0191] The Supermine™ Inventory in the Document Retrieval Process Asexplained earlier, every document stored in the Supermine™ warehouse hasthe header for that document stored in the Supermine™ inventory unitFIG. 14 or FIGS. 1A-1C, (37). These headers are inserted into theinventory unit in the order of the Supermine™ document number that wasassigned when the incoming document was received in the DataReceiver/Transmitters FIGS. 1A-1C, (9). This document number and thedate it was received in Supermine™ remains with the document header aslong as it is stored in Supermine™. This equips the inventory to be asource for Supermine™ clients and employees to determine whether theSupermine™ warehouse contains documents relating to a particularlocation or a specific subject matter. The Supermine™ document number(if known) is used to identify and retrieve the headers for the desireddocuments. Inserting the headers into the document retrieval process aspreviously described is the means of efficiently retrieving the completedocument.

[0192] Referring to FIG. 14, the Supermine™ inventory unit contains anInventory Search, Copy and Transport program FIG. 14, (37-a) that acceptincoming document headers from the Document Transport Control FIGS.6A-6B, (24). It assigns them to an inventory position by reading theheader beginning with the twenty-ninth through the fortieth registerpositions, the Supermine™ number slots. It then assigns the documentheader in an Inventory Document Header Register FIG. 14, (37-b) in theorder received. There are several ways the header can be retrieved fromthe Supermine™ Inventory unit by members of the staff, the industryanalyst or a level three client. First, the header can be retrieved ifthe staff, analyst or client knows one or more of the following sets ofnumbers concerning the document header needed:

[0193] 1. The Supermine™ document number.

[0194] 2. The Supermine™ Template numbers.

[0195] 3. The location that pertains to the subject matter needed (seeFIG. 2).

[0196] 4. The date the document was received. However, if this is theonly identification number known and is the only number used, therequesting party will get the header for every document received on thatdate.

[0197] 5. The subject matter that was used in the document.

[0198] If none of the above numbers are known, there is a way to obtainsufficient numbers to retrieve the header. As previously explained thereis a Location Template and an Industry Template made for every documentthat is stored in Supermine™. Using our example of a person needing toknow how to plant string beans in Pasadena, Tex. the numbers needed toobtain the header can be constructed easily. Referring to FIG. 2, if thelocation only is known, the thirteen digits that make up the locationwarehouse assignments can be determined by following the construction ofthe location template columns. That is all that is needed to recover thedocument header from the Supermine™ Inventory. However if that is allthat is used, the requesting party will get every document that isstored concerning Pasadena, Tex.

[0199] Referring to FIG. 3, using the column numbers of the template isa means of easily deriving the numbers for a specific industryapplication. When these numbers, along with the location numbers areknown, the document header can be obtained from the Supermine™ Inventoryin the following manner. The requesting party addresses the Supermine™Inventory Unit FIG. 14, (37) and types in a “search” command followed bythe numbers relative to the document header needed.

[0200] A brief review of FIG. 4 would be helpful at this point. Noticethat the last five digits of the register are not used. These can beused for the last five digits of the requesting party's personalSupermine™ number or the level three client's account number. At thistime the requesting party puts the last five digits of his/her personalSupermine™ identification number in the register. A level three clientwill use the last five digits of the account number. If more digits areneeded, those normally used for the date the document was received couldbe used in this instance because that date is not important if there areenough numbers to identify the header needed. However a modification ofthe Inventory Search Program, FIG. 14, (37-a) will be required to openthe Inventory Search Register FIG. 14 (37-a) slots in the properidentification number sequence.

[0201] The Inventory Search Program FIG. 14, (37) inserts these numbersinto the Inventory Search Register FIG. 14, (37-a). After afive-millisecond delay to assure that no more numbers follow, theInventory Search Register sends a “numbers received” signal to theInventory Search Program FIG. 14, (37-a). This starts a scanner that isprogrammed to “look” into the Inventory Search Register beginning withthe twenty-ninth register slot continuing through the fortieth registerslot looking for the Supermine™ document header number. If that numberis present, the Search Program sends a command to the scanner to find amatching number. The scanner searches down the list of Supermine™document header numbers until the match is found. See FIG. 14, (37-b)Document Number 0000000385. It then sends a “match” found signal to theInventory Transport Program FIG. 14, (37-a) to copy the content of theentire header where the matching Supermine™ document number was foundinto the Inventory Transport Buffer FIG. 14, (37-d). When the documentheader is received, after a five-millisecond delay a “header received”signal is sent to the Inventory Search Transport Program, FIG. 14,(37-a). This activates the Transport program to copy the content of theInventory Search Register FIG. 14, (37-a) into the Inventory ComparitorBuffer FIG. 14, (37-c). There all numbers are scanned to verify a matchbetween the individual register slots in both registers. In this case,only the slots for the Supermine™ document header number will be filled.If these match, a “header match” signal is sent from the InventoryComparitor FIG. 14, (37-c) to the Inventory Transport Program FIG. 14,(37-a). This activates this unit to enter the Inventory Search registerFIG. 14, (37-a) and copy the content of the last five inventory searchregister slots into the last five register slots in the InventoryTransport Buffer. This is the means of getting the header to therequesting party. When the Inventory Transport Buffer FIG. 14, (37-d)signals that the last five slots in its register are filled, theInventory Transport Program, FIG. 14, (37-a) begins the comparitor scanbetween the content of the inventory Comparitor Buffer FIG. 14, (37-c)and the Inventory Transport Buffer FIG. 14, (37-d). If there is a matchin the numbers, a signal is sent to the Inventory Transport Program FIG.14, (37-a) to copy the content of the Inventory Transport Buffer FIG.14, (37-d) into the Retrieved Document header and text section of theDocument Distribution Center FIGS. 12A-12B, (42-b).

[0202] In the event the requesting party does not know the Supermine™document header number and uses identification from the templates aspreviously described, the operation sequence is the same with theexception that all slots in both the Inventory Comparitor Buffer FIG.14, (37-c) and the Inventory Transport Buffer FIG. 14, (37-d) arescanned and compared to verify a match between the request and theoffered document header. If there is substantial match, a “header match”signal is sent from the Inventory Comparitor FIG. 14, (37-c) to theInventory Transport Program FIG. 14, (37-a) to copy the content of theInventory Transport Buffer FIG. 14, (37-d) into the DocumentDistribution Center FIGS. 12A-12B, (42-b) where the clerk uses therequesting party's Supermine™ identification number or the client'saccount number to down load the retrieved header to the requestingparty.

An Alternative Embodiment Based on Latitude/Longitude

[0203] To demonstrate the scope and flexibility of the Supermine™ Systemthe practicability of utilizing Latitude and Longitude as the basis of adata acquisition, storage and distribution is described. The basicoperation of Supermine™ would remain unchanged. There would be nochanges required in the Block Diagrams FIGS. 1A-1C and 1A-1 to 1A-3 thatdescribe the flow of data as it is processed into and from Supermine™.Referring to FIG. 2, the transition from the use of Nations, States,Cities and Communities as Location identification points to the use ofLongitude and Latitude would require that the Nation, State, Cities andCommunities shown on location template FIG. 2 be replaced with Longitudeand Latitude in the various columns. The column configuration and scanlines would remain basically the same.

[0204] Since Supermine™ is a numerical system, there would be minorchanges in the listing of Longitude and Latitude in that all referencesto North, South, East or West would entail assigning an arbitrary numberto those directional letters. For example North could=1, South could=2,East could=3 and West could=4. The same change could be made for anylettered descriptions such as degrees and minutes. The Supermine™analyst could make these transitions as the latitude and longitudelocation templates are prepared.

[0205] To reduce the size of the location templates and thecorresponding digit registers, it would be more practical to constructtwo location templates, one for the Northern Hemisphere and one for theSouthern Hemisphere with the equator being the natural dividing linebetween them. As stated in the basic disclosure, the design of the DataReceiver/Key Word Comparitors FIGS. 13A-13C, the Location and IndustryKey Word Template Storage Unit, FIGS. 5A-5B, and other Supermine™components make it possible to associate any location point with anyindustry. This means that any longitude and latitude point could beassociated with any weather, soil, tide data, or topographical data thatcan be described on an industry template. More importantly, data for anypoint on the Globe could be polled in Supermine™ to obtain a widevariety of information using the same data retrieval process previouslydescribed in the disclosure.

[0206] Referring to FIG. 4, notice the position of the location registerslots. The first thirteen slots plus one spacing slot would contain thelongitude and latitude digits from a Global Positioning System thatdescribes a precise point on the Globe down to a few feet. The nextthirteen register slots plus one spacing slot used for the industry datawould depict any information relative to that particular point on theearth. There would be no changes required in the basic design of theSupermine™ register assignments. Referring to FIGS. 5A-5B, convertingSupermine™ location identification to a longitude and latitude systemwould require some modification of the Location and Industry Key WordTemplate Storage Unit FIGS. 5A-5B. The sheer size of the system wouldargue for greatly increasing the size of the language section FIGS.5A-5B (30) and making provision to differentiate between inhabited landareas islands and oceans. Changing the terminology in the LocationTemplate section of FIGS. 5A-5B would accomplish this objective.Obviously the change to a latitude and longitude location identificationsystem would greatly increase the size of FIGS. 5A-5B. This is thereason for suggesting a two-hemisphere location template.

[0207] Careful construction of a modified FIGS. 5A-5B would enableclients to enter a request into Supermine™ with only GPS longitude andlatitude numbers and obtain data relative to that particular point onthe earth. These data would be limited only to the amount of detailincluded on the equivalent of the industry templates FIG. 3. Referringto FIGS. 10A-10b, thoughtful consideration of this hypothetical storageunit will clearly demonstrate the feasibility of using longitude andlatitude for the Supermine™ location assignments. The example shown usesthe United States data storage unit to show the configuration of theindustry data relative to the location storage shelves for every state.The position of the Location Template Columns FIGS. 10A-10B (15) couldreadily be converted into longitude and latitude by simply adding thenecessary number of shelves to accommodate the listing of the latitudeand longitude numbers. The size of the registers would dictate atwo-hemisphere storage unit, FIGS. 10 and 10A-10B. The presentassignment of the industry data would remain the same. However theSupermine™ analyst who prepares the industry templates would be requiredto use a much broader scope of knowledge to get oceanographic andtopological data for uninhabited and oceanic locations.

[0208] It should be understood that the above changes to the Supermine™system are only adaptations and modifications to the basic system thatremains the same. This adaptation is included in the disclosure todemonstrate that the Supermine™ system has the potential to make majorchanges in the way industry acquires, process and distributes datathroughout the world. Although the invention has been described in termsof specific embodiments set forth in detail, it should be understoodthat the embodiments disclosed are for illustrative purposes and theinvention is not limited to the aforementioned embodiments. Alternativeembodiments will be apparent to those skilled in the art in view of thedisclosure. Accordingly, modifications are contemplated which can bemade without departing from the spirit of the described invention.

1-25. (Canceled)
 26. A method of operating an electronic switchcomprising: receiving a plurality of data objects; storing the dataobject in a plurality of data comparitors; receiving a first signalindicating that all of the comparitors are busy; receiving an additionaldata object; providing a holding area for data objects; storing theadditional data object in the holding area; receiving a second signalindicating that a comparator is free; and storing the additional dataobject in the comparator.
 27. A method of storing data in a datawarehouse, comprising: receiving a plurality of data objects storing thedata objects in a plurality of data comparitors, such that thecomparitors are busy; receiving an additional data object while thecomparitors are busy; storing the additional data object in a holdingarea; and transferring the additional data object from the holding areato one of the comparitors when the comparitor is no longer busy.
 28. Anapparatus for storing data in a data warehouse, comprising: a datareceiver that receives a data object; a plurality of comparitors, eachfor indexing a received data object for storage in the data warehouse; abusy transfer switch that determines if any of the comparitors isavailable for indexing a data object; and a holding file thattemporarily holds a data object when the comparitors are not availablefor indexing.
 29. The apparatus according to claim 28, furthercomprising: a polling unit that searches for a data object to be storedin the data warehouse.
 30. The apparatus according to claim 28, furthercomprising: a sensing device, associated with the comparitors, forsending a signal to the busy transfer switch indicating that thecomparitors are not available for indexing.
 31. The apparatus accordingto claim 30, wherein the busy transfer switch receives the signal fromthe sensing device and thereby directs the received data object to theholding file.
 32. The apparatus according to claim 31, wherein thesensing device sends a second signal to the busy transfer switchindicating that a comparitor is available for indexing.
 33. Theapparatus according to claim 32, wherein the busy transfer switchdirects that the data object in the holding file to the comparitoravailable for indexing.
 34. A method of storing a data object in a datawarehouse, comprising: receiving a data object; identifying a locationrelated to the data object; identifying an industry related to the dataobject; and indexing the data object in the data warehouse based on theidentified location and industry.
 35. A method of retrieving a dataobject stored in a data warehouse, comprising: receiving a request for adata object stored in the data warehouse; parsing the request toidentify a location and an industry related to the request; andretrieving the data object from the data warehouse based on theidentified location and industry.